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Field of the Invention 

This invention relates to a method for producing functional, novel 
DNA sequences using biological selection of random nucleotide sequences, and to 
the DNA sequences so produced. 

Background of the Invention 

New methods to rearrange nucleotide sequences in DNA have 
enriched our understanding of how protein structure governs function. By 
monitoring the effects of particular substitutions on structure and activity one 
can explore putative catalytic mechanisms. The most direct approach is to select 
the region of a gene that codes for the active site of an enzyme and 
systematically substitute nucleotides based on a knowledge of the amino acid 
groups, the mechanism for catalysis, and three dimensional structure. This 
approach has been applied to diverse enzymes including: trypsin (Craik, et al., 
1985; see the appended citations), iysozyme (Perry <Sc Wetzel, 1984), and 
e-lactamase (Dalbadie-McFarland, et al., 1986) with considerable success. We 
demonstrate here an alternative strategy. Instead of making precise substitution 
based on detailed knowledge of structure and function, one can insert into genes 
stretches of nucleotides containing random sequences and use biological selection 
to obtain new proteins harboring a spectrum of substitutions. 

In our initial studies on the selection of nucleotide sequences from 
random populations, we examined the -35 region of the promoter of the gene for 



tetracycline resistance (Horwitz <5c Loeb, 1986; 1988b). We obtained 85 new active 
promoters, many of which bore little resemblance to the promoter consensus 
sequence and some of which were more active than the consensus sequence or the 
wild type tetracycline promoter. We have now remodeled the gene coding for 
S-lactamase^ by replacing DNA at the active site with random nucleotide 
sequences, and have selected functional active site mutants having altered 
catalytic activities. 

Summary of ^e Invention 

The invention provides, in one embodiment, a method of obtaining an 
oligonucleotide capable of carrying out a predetermined biological function. A 
heterogeneous pool of oligonucleotides, x + y + z nucleotides in length, is first 
generated. Each oligonucleotide has a 5' randomized sequence, x nucleotides in 
length, a central preselected sequence, y nucleotides in length, and a 3' 
randomized sequence, z nucleotides in length. The resulting heterogeneous pool 
contains nucleic acid sequences representing a random sampling of the 4 
possible sequences for oligonucleotides of the stated length. A random sampling 
of the heterogeneous pool of oligonucleotides is introduced into a population of 
cells that do not exhibit the predetermined biological function. The population of 
engineered ceils is then screened for a subpopuiation of cells exhibiting the 
predetermined biological function. From that subpopuiation of cells is isolated an 
oligonucleotide containing the preselected sequence and capable of carrying out 
the predetermined biological function. 

In a related embodiment, the heterogeneous pool of oligonucleotides 
may be generated from a biased mixture of nucleotides. The heterogeneous pool 
of oligonucleotides, n nucleotides in length, is synthesized from a mixture of 
nucleotides consisting essentially of a% adenine, t% thymidine, c% cytosine, and 
g% guanine, wherein a + £ + c + g= 100%, The resulting heterogeneous pool 
contains nucleid acid sequences representing a random sampling of the 4^ possible 
sequences for oligonucleotides of the stated length generated from nucleotides of 
the stated relative concentrations, A random sampling of the heterogeneous pool 
of oligonucleotides is introduced into a population of cells that do not exhibit the 
predetermined biological function. The population of engineered cells is screened 
for a subpopuiation of cells exhibiting the predetermined biological function, and 
from the subpopuiation of cells an oligonucleotide capable of carrying out the 
predetermined biological function is isolated. 

f 



Brief Description of the Drawings 

The details of typical embodiments of the present invention will be 
described in connection with the accompanying drawings in which: 

FIGURE 1 presents a schematic overview of the representative 
synthesis, transformation, and selection steps described in Example 1; 

FIGURE 2 is an RNA gel blot analysis quantifying tetracycline 
resistance gene transcription in various plasmid populations described in the 
Examples; 

FIGURE 3 shows the substitutions made in the wild type promoter 
sequence for the tetracycline resistance gene, as described in Examples 1 through 
5 and summarized in the Discussion following Example 5; 

FIGURE 4 shows the construction of plasmid pBDL, as described in 

Example 8; 

FIGURE 5 illustrates the overall scheme for the insertion of 
oligonucleotides containing random DNA inserts described in Example 8, with 
(A) Step I showing construction of the nonproducer strain pBNP, and (B) Step II 
showing replacement of the oligonucleotide in the nonproducer strain with random 
DNA sequences, wherein amp = ampieillin, tet = tetracycline, and n = unspecified 
bases; and 

FIGURE 6 shows the active site sequence substitutions obtained as 
described in Example 8, including the determined nucleotide sequence of each of 
the carbenicillin resistant mutants, and the deduced amino acid sequences. 

Detailed Description of the Invention 

The invention provides methods for producing novel, functional DNA 
sequences using biological selection. To carry out the present invention, a library 
of all possible DNA sequences is used to transform, transfect or infect suitable, 
competent host cells. Biological selection is used to isolate functional DNA 
sequences exhibiting a desired biological activity. The process of the invention is 
set forth in greater detail in the description and examples that follow. 

Synthesis of random DNA sequences. DNA sequences in the form of 
oligonucleotides can be readily synthesized using various procedures, including 
phosphoramidite synthesis or phosphotriester chemistry, as described, e.g., in 
Oligonucleotide Synthesis: A Practical Approach , Gait, Ed., IRL Press, Oxford, 
England (1984), incorporated by reference herein, or may be obtained already 
^synthesized from a commercial source. Synthesis of oligonucleotides has been 
greatly simplified by automation. Once synthesized, the DNA may be retrieved 
and purified using techniques, individually or in combination, such as gel 



electrophoresis, high performance liquid chromatography or thin layer 
chromatography. A mixed population of oligonucleotides, heterogeneous and 
random in DNA sequence, may also be produced using these methods. Individual 
oligonucleotides of lengths up to several hundred basepairs can then be hybridized 
to one another to construct DNA duplexes for mutagenesis. 

In the present invention, the number of possible replacement DNA 
sequences generated for a given DNA sequence will depend on the number of 
randomly substituted nucleotide bases in the synthetic oligonucleotides 
synthesized. For example, for a population of random DNA oligonucleotides, each 
19 basepairs in length, there exists a maximum of 4^^ (3 x 10-^^) different possible 
replacement sequences (combinations of nucleotide bases). 

Cloning of the random DNA sequences. The random DNA sequences 
synthesized as described above are used to transform prokaryotic or eukaryotic 
cells. The DNA may be introduced using cloning vectors including plasm ids, 
vectors capable of integrating into host ceil chromosomes, for example, 
bacteriophage lambda, other bacteriophages (e.g,, the M13 family of filamentous 
bacteriophage vectors described by Messing (in Methods in Enzymology , 101, 
pp. 20-78, Academic Press (1983)), viruses, or cosmids, or by vectorless gene 
transfer wherein linearized DNA containing the gene of interest is added directly 
to a culture of host cells, resulting in transformation of the cells. Glover, D.M., 
in Gene Cloning: The Mechanics of DNA Manipulation , London, Chapman and Hall 
(1984). Other methods of introducing DNA into a cell may also be used, for 
example, direct injection such as microinjection, or using sperm carriers. 

Insertion of random DNA sequences. Where a vector is used as the 
cloning vehicle, the vector chosen may be altered by deleting a particular region 
of DNA known to contain a functional site, for example, the promoter recognition 
site for a particular gene, using standard methods, such as cleavage by restriction 
enzymes, to insert the synthetic random DNA sequences. The termini of random 
DNA sequences, synthesized as described above, may be modified so as to be 
easily inserted into the vector selected. For example, sequences with ''sticky'^ 
ends generated by cleavage by restriction enzymes may be inserted into a vector 
which has been digested using the same enzymes. The random DNA sequences are 
then inserted into the vector with a ligation enzyme, for example, the enzyme T4 
DNA ligase, using standard techniques such as those described by Maniatis, et aU, 
Molecular Cloning; A Laboratory Manual , Cold Spring Harbor, New York (1982), 
incorporated by reference herein. 



The random DNA sequences may also be directly inserted into a 
vector without removing any DNA, for example, to select for DNA sequences 
encoding a novel protein. Random sequences may thus be introduced into an open 
reading frame (a DNA sequence encoding a protein and containing appropriate 
regulatory elements for translation including a promoter, ribosome binding site, 
translation initiation codon, and stop codon) by ligating the random fragments to 
the appropriate genetic regulatory elements, such as transcription and translation 
initiators, for example the promoter, or Shine-Daigarno ribosome binding site. 

Expression of recombinant DNA. The recombinant DNA vectors 
containing the inserted randomly synthesized DNA sequences are then used to 
transform or transfect suitable host cells such as bacteria. An example of a 
suitable bacterium is colu If E. coll bacteria are used as a host, a preferred 
method for transformation is the high efficiency technique described by Hanahan, 
J. Mol. Biol. , 166, pp. 557-580 (1983), incorporated by reference herein. Host cells 
are then grown in an antibiotic, such as ampicillin, to select for those cells which 
contain recombinant plasmids carrying the cointroduced random DNA sequences. 

The recombinant DNA may then be purified from host cell 
preparations and sequenced using well-known procedures to verify that it contains 
the inserted synthetic DNA sequences, and to reveal the sequence of the inserts- 

Biological selection. The methods of the present invention use 
biological selection of randomly synthesized DNA sequences to identify and 
isolate those sequences that exhibit a desired biological function. To accomplish 
such selection, a procedure for detecting the desired functional activity must be 
chosen. In Examples 1-5 below, the function selected for is the ability to promote 
transcription of the gene for tetracycline resistance. In these examples, the 
antibiotic tetracycline is used to select for those synthetic DNA sequences able to 
function as promoters for transcription of the tetracycline resistance gene, as 
determined by the ability of E. coa* cells carrying plasmids to grow in the presence 
of the antibiotic- 
Novel DNA sequences capable of a different function, for example 
"leader" sequences (also known as "signal sequences"), may also be selected from a 
population of random DNA fragments- Leader sequences allow for the 
transmembrane insertion or extracellular secretion of cellular proteins, and are 
common to both prokaryotes and eukaryotes. Leader sequences from different 
proteins are composed predominately of hydrophobic amino acid residues and 
^range in length from 13 to 36 amino acid residues- Alberts, B.M., et al., in 
Molecular Biology of the Cell , New York, Garland Publishing, Inc., (1983). While a 



consensus leader sequence has been established for several proteins (Kretsinger, 
R. H. <Sc Creutz, C. E., Nature ^ 320 p. 573 (1986)), the leader sequences of most 
proteins differ considerably and contain no homology to each other, (von Heijne, 
G,, J> Mol« BloU , 184 pp, 99-105 (1985)), sugg'esting' that a great variety of protein 
sequences may possess this secretory function. Leader sequences may be 
identified, e*g., by using a gene for a secreted protein, such as 8-lactamase, 
capable of conferring positive growth selection. 6-lactamase is secreted 
extracellularly and protects cells from attack by antibiotics, such as ampicillin 
(Alberts et aK, supra)* The 5^ portion of the 8-lactamase gene of the plasmid 
pBR322 contains 66 nucleotides coding for a 23 amino acid residue leader 
sequence of the protein product. Ambler, R. P. 6c Scott, G- K., Proc. NatU. Acad. 
Sci. (USA) , 75, pp. 3732-3736 (1978); and Sutcliffe, J. G. Proc. Nat'l Acad. Sci. 
(USA) , 75, pp. 3737-3741 (1978). Frameshift mutations, point substitutions, and 
deletion mutations in this region of the gene produce defective proteins that are 
not secreted from the cell. Koshland, D, A Genetic Analysis of Beta-lactamase , 
Ph.D. thesis, Mass. Inst. Technol., Cambridge (1982). Consequently, bacteria 
harboring plasmids with nonfunctional mutations in the leader sequence of the 
B-lactamase gene are sensitive to the antibiotic ampicillin, whereas bacteria 
harboring plasmids with intact 6-lactamase genes are resistant to ampicillin. As a 
result, host cells containing vectors with inserted DNA sequences not capable of 
functioning as 6-lactamase leader sequences will not survive in media containing 
the antibiotic ampicillin. However, ceils which do secrete 6-lactamase and 
survive in media containing the antibiotic will contain vectors having synthetic 
DNA sequences capable of functioning as leader sequences, and these sequences 
may then be readily isolated and sequenced as described above. 

In addition, growth selection using the present invention may be used 
to select DNA sequences capable of encoding novel proteins. Random DNA 
sequences may contain unique sequences that are capable of coding for 
biologically active proteins. Thus, long, random DNA fragments may be ligated 
into a vector such as a plasmid vector containing the regulatory elements for gene 
expression, including DNA encoding promoter and ribosome binding sites. The 
resulting vector will thus be able to transcribe random proteins from the randomly 
generated open reading frame. A heterogeneous population of plasmids, each 
coding for a unique, random protein may be used to transform appropriate 
eukaryotic or prokaryotic host ceils such as E, coli cells. From these ceils, 
proteins possessing novel catalytic activity may be identified using growth 
selection. Alternately, to isolate a novel protein capable of degrading a particular 
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plastic, the transformants may be grown on media, for example, Luria-Bertani 
(LB) agar, covered with the particular plastic as a growth barrier, and then 
layered with more media (e.g., LB agar). Bacterial clones containing random DMA 
sequences conveying plastic-degrading activity will penetrate the plastic barrier 
5 to grow in the topmost layer of agar. An example of the use of this invention to 

select DNA sequences encoding novel proteins from a population of random DNA 
sequences is described below, in Example 7» 

From the foregoing, it can be appreciated that the biological 
selection of the present invention provides a technique for screening large 
10 numbers of random, synthetic DNA sequences to identify those novel sequences 

capable of carrying out the chosen function. The present invention does not 
require characterization (e.g., sequencing, hybridization, and other procedures) of 
the synthesized DNA sequences prior to the selection process, thereby eliminating 
the need for tedious and time-consuming manipulations. 
-15 Other parameters useful for biological selection based on the ability 

of transformed cells to grow in certain media, in addition to antibiotic resistance, 
include resistance to toxins such as metals and environmental pollutants such as 
polychlorinated biphenyl (PCB), and the ability to grow in media containing or 
lacking specific metabolites, for example, in the presence of unusual carbon 
20 sources including oils and environmental pollutants. 

The present invention may also be used to select for novel, synthetic 
DNA sequences capable of encoding a peptide which exhibits the biological 
activity of the peptide encoded by the natural (wild-type) DNA sequences, or for 
entirely pew, functional peptides. Standard assays may be used to detect the 
25 presence or absence of expressed peptide in a growth medium or even within the 

host cells. For example, visually screenabie assays to indicate the presence of a 
peptide using enzyme and enzyme substrate reactions to generate a color signal, 
such as the blue color produced by 8-galactosidase in the presence of sugar 
analogs, may be used. The expressed peptide must then be tested for the ability 
30 to carry out the function of the corresponding wild-type DNA peptide or to 

determine whether a peptide exhibiting the new function has been produced. 

Examples 1 through 5 below, describe representative protocols for 
obtaining novel, synthetic DNA sequences with promoter activity. A comparison 
of known RNA polymerase-binding sites of different genes of E, coli reveals two 
35 ^highly conserved promoter elements centered at about 10 and 35 nucleotide 
basepairs upstream from the start of transcription- Pribnow, J. Mol. Biol., 99, 
• pp. 419-443 (1975); Schaller et aL, P.N.A.S. USA , 72, pp. 737-741 (1975); 
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Takanami, et al.. Nature , 260, pp, 297-302 (1976); Seebur^, et al., Eur. J. 
Biochem., 74, pp. 107-113 (1977); and Rosenberg and Court, Ann. Rev. Genet , 13, 
pp, 319-353 (1979). The involvement of each nucleotide in the initiation of 
transcription has been inferred largely from an analysis of mutations. Rare 
5 mutations that increase transcription (so-called "up" mutations) usually increase 

homology with the consensus sequence and spacing, while the more common 
mutations that decrease transcription ("down" mutations) usually decrease 
homology with the consensus sequence and spacing. McCiure, supra. 
Nevertheless, as demonstrated by the present invention, sequences that greatly 
10 differ from the consensus sequence can still function as promoters. 

The present invention uses random mutagenesis to produce novel, 
synthetic DNA sequences which are capable of exhibiting biological activity, 
although such sequences may substantially deviate from known, wild-type 
sequences having that same function in nature. The process described above and 
15 in the Examples which follow, extends beyond the use of mutagenesis merely as a 

tool to study the nature of regions of DNA encoding specific biological activity. 
The present invention uses significantly expanded random mutagenesis, i.e., 
heterogeneity, at a substantial number of nucleotide bases within a DNA 
sequence, to produce novel, functional sequences. Furthermore, the use of 
20 selection pressure, e.g., binding a preselected antigen, provides an efficient, cost- 

effective and rapid means for screening large numbers of synthetic DNA 
sequences to isolate those novel DNA sequences capable of, e.g., coding for an 
immunological variable region of desired specificity. 

In summary, the invention provides a method of obtaining an 
25 oligonucleotide capable of carrying out a predetermined biological function. 

First, a heterogeneous pool of oligonucleotides is generated. The oligonucleotides 
are typically n nucleotides in length, and the nucleic acid sequences of the 
oligonucleotides in the heterogeneous pool typically represent a random sampling 
of the 4'^ possible sequences for oligonucleotides of that length. A random 
30 sampling of the heterogeneous pool of oligonucleotides is introduced into a 

population of cells that do not exhibit the predetermined biological activity. The 
population of cells is thereafter screened for a subpopulation of engineered cells 
that does exhibit the predetermined biological function. From the subpopulation 
of cells, an oligonucleotide capable of carrying out the predetermined biological 
35 function is isolated. 

^ in the practice of the invention, the randomized oligonucleotide can 

range in length (n) from about 3 or more (or 1 codon or more, if the 
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oligonucleotide is to be expressed) to several hundred or more nucleotides in 
length. It is generally preferable to limit the size of the randomized oligonu- 
cleotide to about 24 bases, coding for 8 amino acids, as this will potentially 
produce a heterogeneous pool of about 8^*^ or a billion different oligonucleotides, 

5 which is convenient for screening. 

The randomized oligonucleotide may alternatively include first and 
second randomized regions (x and z nucleotides in length, respectively) that flank 
on either side a linker region (y nucleotides in length) of preselected sequence- 
The heterogeneous pool of these tripartite oligonucleotides represents a random 

10 sampling of the 4^^^ possible sequences of such oligonucleotides of that length 
(x + y + z). A random sampling of this heterogeneous pool of oligonucleotides is in 
turn introduced into a population of cells that do not exhibit the predetermined 
biological function. The population of cells is thereafter screened for a 
subpopulation of cells exhibiting the predetermined biological function. That is, 

15 the functionality of the tripartite oligonucleotides is screened as a unit. From the 
selected subpopulation of cells, an oligonucleotide containing the linker sequence 
and capable of carrying out the predetermined biological function is isolated. 
Representative protocols are set forth in Example 8 below. The linker sequence 
can correspond with a native sequence or can be a novel sequence that is present 

20 by design in all of the randomized oligonucleotides. For example, a codon for 
serine in a native sequence can be changed in a preselected manner to a codon for 
another nucleophilic amino acid, such as cysteine. 

In this embodiment, the overall length of the randomized portion of 
the oligonucleotide (x + z) is preferably held to no more than about 24 bases. The 

25 individual and relative lengths of the flanking randomized and central conserved 

sequences are not critical and may be dictated by site-specific considerations, 
such as a previously observed association of the linker sequence with the screened 
function. 

As described above, the nucleotide composition of the randomized 
30 oligonucleotide region(s) may alternatively be biased in favor of or away from any 

of adenine, thymidine, cytosine, and/or guanine. Such biasing is conveniently 
accomplished by adjusting the relative concentrations of the dNTPs from which 
the oligonucleotides are randomly synthesized. In other words, the relative 
concentrations of A, T, C, and G need not be about 25% each, e.g., A may 
35 constitute a%; T, t%; C, c%; and G, g%; provided that a-^t^c^g^ 100%. The 

^resulting heterogeneous pool of oligonucleotides, n nucleotides in length, will 
nevertheless have nucleic acid sequences representing a random sampling of the 
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4^^ possible sequences for oligonucleotides of that length synthesized from a pool 
of nucleotides having such preadjusted relative concentrations. 

The following Examples are presented to illustrate the present 
invention and to assist one of ordinary skill in making and using the same. The 
examples are not intended in any way to otherwise limit the scope of the 
disclosure or the protection granted by the Letters Patent hereon. 

EXAMPLE 1 

Production of Novel DNA Sequences With Promoter Activity 
In this Example, novel DNA sequences capable of functioning as 
promoters in the tetracycline resistance gene were produced by: (1) chemical 
synthesis of a heterogeneous population of DNA fragments, 19 basepairs in length, 
with the nucleotide at each position of the 19 basepairs being random; (2) insertion 
of these DNA fragments into pBR322 plasmids with segments of DNA deleted in 
the region of the plasmid known to encode for promoter recognition; 
(3) transformation of £. coli bacteria with the recombinant plasmid; and, (4) 
selection of transformed host cells by growth in media containing the antibiotic 
tetracycline. These steps are summarized in FIGURE 1 and described in detail 
below. 

Restriction enzymes used in this Example were purchased from 
Bethesda Research Laboratories (Gaithersburg, MD), Boehringer Mannheim 
(Indianapolis, IN), and New England Biolabs (Beverly, MA), and, except where 
indicated, use of these enzymes followed the manufacturers^ directions. The 
enzymes T4 DNA ligase, T4 polynucleotide kinase, large fragment of DNA 
polymerase I (Klenow Fragment), and bacterial alkaline phosphatase were 
purchased from Bethesda Research Laboratories. Nick-translation reagents 
(including DNA polymerase I and DNase I) were purchased from New England 
Nuclear (Boston, MA) and used according to the manufacturer's instructions. 

Verification of the promoter recognition sequence in pB R322, Prior 
to synthesis of the random population of DNA sequences, the region of plasmid 
DNA containing the promoter recognition sequence for transcription of the 
tetracycline resistance gene was deleted using restriction enzymes to verify the 
importance of this region for the transcription of that gene. 

The plasmid pBR322 contains a DNA sequence corresponding to the 
tetracycline resistance gene, encoding a single, noninducible 43.5 kilodalton 
polypeptide (Blackman and Boyer, Gene , 26, pp. 197-203 (1983)) that functions at 
jfche cell membrane to block accumulation of the antibiotic tetracycline. McMurry 
et ai., P.N.A.S. USA , 77, pp. 2974-3977 (1980). The transcription initiation site 



for the gene has been identified by Sl-nuclease protection. Brosius et al., J. Biol. 
CheiTU, 257, pp. 9205-9210 (1982). In addition, the position of the promoter 
recognition site for the tetracycline resistance gene has been deduced from: (1) 
promoter consensus sequence homology in pBR322 (Sutcliffe, Cold Spring Harbor 
Symp. Quant. BioL , 43, pp. 77-90 (1979)); (2) deletion mutations (Rodriquez et al., 
Nucl. Acids. Res. , 6, pp. 3267-3287 (1979)); and (3) electron microscopic mapping 
(Stuber and Bujard, P.N.A.S. USA , 78, pp. 167-171 (1981)). 

FIGURE 1 shows the tetracycline resistance gene promoter sequence 
as described by Sutcliffe, supra. Portions of this promoter sequence that 
correspond to the -10 position and -35 promoter recognition site of the consensus 
sequence are shown in boxes. Hawley and McClure, Nucl. Acids Res. , 11, pp. 
2237-2255 (1983). In FIGURE 1 the letter "N" indicates unspecified bases. 
Restriction enzyme cleavage sites for the endonucleases £coRI and Cla I flank the 
promoter recognition site in the tetracycline resistance gene. A plasmid 
designated as pBdEC with a 22 basepair deletion in the promoter recognition site, 
extending from the -43 position (in the EcoRl site) to -21 (in the Cla I site) was 
constructed by digesting pBR322 DNA with EcoRl and Cla I, purifying the larger 
DNA fragment by electrophoresis in low melting temperature agarose (Bethesda 
Research Laboratories), filling in the 5' overhangs with the large fragment of DNA 
polymerase E in the presence of all four deoxynucleoside triphosphates, and 
recircularizing the blunt-ended fragment with T4 DNA ligase, using the 
procedures detailed in Maniatis, supra. The expected deletion was confirmed by 
DNA sequence analysis. 

The recircularized plasmids were used to transform £. coli strain AGl 
(Stratagene Cloning Systems, San Diego, CA), and E. coli found to contain the 
recombinant plasmid were cultured in the presence of both tetracycline and 
ampicillin. The manufacturer's protocol for transformation using these plasmids 
was followed- Briefly, transformed host cells were grown in LB for one hour 
(approximately two generations) prior to antibiotic selection on LB-containing 
agar* Ampicillin concentration in the agar was 50 pg/ml. The transformation 
efficiency was approximately 5 x 10*^ colonies/yg of ligated DNA. The bacteria 
were found to be resistant to a tetracycline concentration of 2 ug/ml in the 
absence of a plasmid, to a concentration of 40 yg/ml when harboring the plasmid 
pBR322, and to only 4 ug/ml when harboring pBdEC (the plasmid with the promoter 
recognition site deleted, as described above). The DNA sequence in which the 
promoter recognition site has been deleted is depicted in FIGURE 3. 



Transcription of the tetracycline resistance gene was identified by 
northern blot analysis using the procedure as described by Maniatis et al., supra* 
Briefly, in this procedure, 5 ug of nucleic acids (DNA and RNA) were purified from 
the E. coli AGl harboring the pBdEC plasmid, using standard procedures, such as 
described by Brosius et al., J> BioL Chem. , 257, pp, 9205-9210 (1982), incorporated 
by reference herein. 114 ng of the extracted pBR322 DNA was electrophoresed on 
1% agarose gel containing 2.2 M formaldehyde. The gel was then transferred to 
hybridization membranes (New England Nuclear) and screened using the 787 
basepair, P-labeled, nick-translated EcoRV to Nru I restriction fragment probe 
from the coding region of the tetracycline resistance gene in pBR322. Filter 
hybridization was performed according to the manufacturer's instructions (New 
England Nuclear) and was conducted at 42°C in 10% dextran sulfate, IM NaCl, 
and 1% NaDodSO^. Filters were washed in 2 x SSC and 1% NaDodSO^ and then 
0.1 X SSC at 25*^0 (1 x SSC is 0.15M NaCl/15 mM sodium citrate, pH 7.0). 
Maniatis et al., supra. The Hind III digest of bacteriophage lambda x DNA was 
used as a size marker. Ethidium bromide staining of the agarose gel in the 
absence of formaldehyde was used to ensure that all lanes contained equal 
amounts of DNA. The autoradiogram (FIGURE 2) was exposed for 8 hours at room 
temperature on XAR film (Eastman Kodak, Rochester, NY). A GS 300 Scanning 
Densitometer (Hoefer Scientific Instruments, San Francisco, CA), which measures 
the degree of exposure of the film proportional to the intensity of the radiation in 
the autoradiogram, was used to quantify hybridization. As can be seen from 
FIGURE 2, deletion of the promoter recognition sequence results in absence of 
transcription of the tetracycline resistance gene. 

Synthesis of oligonucleotides. A heterogeneous population of DNA 

segments which were random in sequence at each of the 19 basepairs thereof was 

synthesized by the phosphoramidite method, using an Applied Biosystems 380A 

DNA synthesizer (Foster City, CA), and purified by thin layer chromatography as 

described by Alvarado-Urbina, et al., in Science , 214, pp. 270-274 (1981), 

incorporated by reference herein. The oligonucleotides were synthesized using 

approximately equimolar mixtures of phosphoramidites at each coupling step. The 

population of recombinant plasmids produced by insertion of these synthetic DNA 

1 9 

sequences therein, as described below, could yield up to a maximum of 4 (or 
approximately 3 x 10^"^) different replacement sequences. 

An 8-basepair (bp)-iong oligomer, 5^-GGATCGAT-3S was hybridized to 
an oligomer 35 bp long, 5'-CCGAATTC(A,C,G,T)^g ATCG ATCC-3% in a 2:1 molar 
ratio, extended with the large fragment of DNA polymerase I, and digested with 
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an excess of the enzymes EcoRI and Taq I. The 8-bp oligomer {10,8 yg) and the 35^ 
bp oligomer (24 ug) were annealed m 90 mM NaCl, 15 miM tris-HCl (pH 7.9), and 
ImM MgCl2 m a total volume of 120 yl by heating at BS^'C for 5 min and then at 
57^G for 90 min, followed by immediate chilling on ice for 15 min. Water and 
other reagents were added to this reaction to bring the final volume to 240 yl 
containing 45 miM NaCl, 3 mM MgCl2j ImM dithiothreitoi, 100 yM of each of the 
four dNTPs and 16 units of the large (Klenow) fragment of DNA polymerase I. 
The reaction was incubated for 1 hour at 25*^0 and then terminated by purification 
on NENSORB columns (New England Nuclear). The eluate was dried under a 
vacuum, resuspended in 150 yl containing 50 mM Tris-HCl (pH 8.0), 10 mM MgCl2> 
50 mM NaCl, and 250 units of the enzyme Taq I. The reaction mixture was 
incubated for 21 hours at 37**C. 5 yl containing 50 units of Taq I were added, and 
incubation continued for an additional 8 hours at 37^C. 55 yl (250 units) of EcoRl 
was then added, and incubation continued for an additional 16 hours at 37 '^C. 
Progress of the digestion was monitored by electrophoresis of the 
dephosphorylated and ^^P-kinased DNA fragments on a 12% polyacrylamide with 

7 M urea sequencing gel, as described by Perbal, in A Practical Guide to Cloning , 
Wiley-Interscience Pub., New York (1984). The restriction fragments were 
purified by phenol extraction followed by ethanoi precipitation. Approximately 

8 yg of double-stranded DNA derived from the synthesized oligonucleotides was 
thereby obtained, a portion of which was inserted into the plasmid pBR322 as 
described below. 

Plasmid construction. Plasmids containing the heterogeneous 
population of synthesized and duplexed oligonucleotides, collectively denoted as 
plasmid "pRAN4," were produced by digesting pBR322 with the enzymes KcoRI 
and Cla I, removing 5' phosphates with bacterial alkaline phosphatase, purifying 
the resulting large DNA fragment by electrophoresis in low melting temperature 
agarose gel, and ligating the appropriate double-stranded DNA restriction 
fragment obtained as described above with the vector DNA in a 20:1 DNA insert 
to vector molar ratio using T4 DNA ligase. Maniatis et al., supra. The sequences 
of the random DNA fragments contained in the plasmids within the pRAN4 
population are indicated in FIGURE 3 (plasmids pBG8, pBB3, pBB5, pBB9, pBBlO, 
and pBB13). 

Transformation. Competent E. coii cells, strain AGl (recAl) 
prepared as described by Hanahan, J. Mol. Biol , 166, pp. 557-580 (1983), 
incorporated by reference herein, were purchased from Stratagene Cloning 
Systems and used for high efficiency transformation. All transformed colonies 



were grown in Luria-Bertani agar containing 0.1% w/v glucose for 60 minutes 

(approximately two generations) prior to antibiotic selection. The transf ormants 

were then placed in LB agar containing 50 ug/ml ampicillin to identify bacteria 

carrying recombinant plasmids with functional random DNA inserts. 

E, coli AGl was transformed using 410 ng of DNA from the pRAN4 

3 4 

plasmid population, obtained as described above, at an efficiency of 10 to 10 
colonies/yg DNA using ampicillin selection and 10 to lO'^ colonies/ug DNA with 
ampicillin and tetracycline selection. Table 1 shows the number of colonies 
produced by transformation using pRAN4 (lA) and two other plasmid populations, 
pRAN3 and pBT9R (described below). 

TABLE 1 

Transformation With Random Plasmid Populations 
amp selection amp and tet sel ection 



Sequences, no. Seque nces, no. 

Colonies, Colonies, 
DNA No. pBR322 Deletions Inserts No. pBR322 Delet ions Inser 

A. pRAN4 125 2 3 5 28 27 0 1 

B. pRAN3 887 0 0 10 23 21 0 2 

C. pBT9R 353 114 0 0 4 

Ampicillin selection yielded 125 colonies. To determine the nature of the DNA 
insertions present, plasmids from 10 of these ampicillin resistant colonies were 
characterized by size fractionation using electrophoresis on 1% agarose slab gels 
and DNA sequence analysis (dideoxy chain termination method). Two plasmids 
were found to be identical to plasmid pBR322, and three plasmids containing 
deletions bounded by the EcoRi and Cla I sites were identified (sequences not 
shown). These plasmids are assumed to be part of a background of about 50% of 
the vectors which either escaped digestion with restriction enzymes or ligation of 
the inserted DNA sequence. The remaining five plasmids, pBB3, pBB5, pBB9, 
pBBlO, and pBB13, were found to contain promoter substitutions of from 10 to 23 
basepairs in length (FIGURE 3). Of a total of 77 bases substituted among these 
five plasmids, the average insert length was 15 basepairs long and the composition 
was 22% adenine, 27% cytosine, 34% guanine and 17% thymine. 
^ Biological selection using tetracycline. Tetracycline resistance was 

used for biological selection for promoter activity in plasmids containing promoter 
substitutions of synthetic DNA sequences. The tetracycline resistance gene of 



pBR322 previously has been used as a promoter probe* Widera, et al., Molec. Gen. 
Genet. , 163, pp. 301-305 (1978); Neve et ai.. Nature , 277, pp. 324-325 (1979); and 
West and Rodriquez, Gene , 7, pp. 291-304 (1982). In these experiments, the 
promoter typically was inactivated by deletion within the recognition site. 
Restriction digests of genomic DNA from prokaryotes and eukaryotes were iigated 
into the inactive promoter region as a test of their potential to restore promoter 
function. Depending upon the organismal source of the DNA, insertions with 
promoter activity were selected at a frequency of 0.2 to 33% from among all 
recombinants. In the absence of DNA sequence data, this high frequency of 
promoter selection was explained as either verification of functional promoters 
(West et al.. Gene , 7, pp. 271-288 (1979)) or as fortuitous restoration of the 
deleted portion of the native promoter. Brosius, Gene , 27, pp. 151-160, (1984). 

Tetracycline resistance of the E. coli containing plasmids with 
inserted synthetic DNA sequences was determined by the efficiency of plating- 
50% method (EOPgQ) described by West et al. in Gene , 7, pp. 271-288 (1979). Test 
colonies were inoculated using a sterile needle on LB agar with 0.1% (w/v) 
glucose, containing 0, 1, 2, 4, 8, 8, 10, 16, 20, 30, 40, 50, 60, 70, 80, 90, and 100 
ug/ml tetracycline. Inhibitory values were defined as those which reduced growth, 
as measured by reduction to one-half the number of viable colonies. 

Ampicillin and tetracycline selection in combination yielded 28 
colonies. To characterize the plasmids from all 28 colonies to determine if any 
contained promoter sequences, plasmids were purified for gel electrophoresis 
sizing and DNA sequencing by lysing the host E. coZi cells by brief treatment with 
lysozyme (Sigma Chemical Co., St. Louis, MO), then detergent and sodium 
hydroxide!. Cell debris was pelleted by centrifugation, and the plasmid was 
purified from the cleared supernatant using ethanol precipitation. Maniatis, et 
al., supra, Plasmid sizes were then compared using agarose gel electrophoresis, 
and about 200 nucleotide basepairs centered at the promoter recognition site were 
sequenced. DNA sequencing was conducted using the dideoxy chain termination 
method (Sanger et al., P.N.A.S. (USA) , 74, pp. 5463-5467 (1977), incorporated by 
reference herein, directly from double-stranded pBR322 templates from these 
rapid preparations. (Wallace et ai.. Gene , 16, pp. 21-26 (1981)). 27 plasmids were 
found to be identical to pBR322. Again, these are from the background of 
unmodified vectors because it is improbable that a promoter sequence identical to 
pBR322 would be present within the small subpopuiation of all random sequences 
selected here- The other plasmid, pBG8, contained a 38 basepair promoter 
substitution (FIGURE 3). Although there was sequence length heterogeneity, the 
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average length of all the pRAN4 substitutions including those not selected for 
tetracycline was 19 bases, in agreement with the length of the synthetic DNA 
sequences. Therefore, approximately half of the 125 ampicillin resistant colonies 
contained plasmids with promoter substitutions and about 1 out of 63 colonies was 
also tetracycline resistant. These results suggest that approximately 2% of the 
3 X 10^-^ possible random sequences may duplicate promoter recognition site 
activity. 

EXAMPLE 2 

Production of Novel DNA Sequences Lacking Adenine with Promoter Activity 

To verify that the synthetic restriction DNA fragments were 
responsible for the observed sequence heterogeneity at the promoter recognition 
site, and to check for allowable sequence diversity, a population of plasmids, 
"pRAN3," was prepared so as to be deficient in the nucleotide base adenine in the 
sense strand throughout the randomly substituted sequences of DNA. Thus, 
oligonucleotides were synthesized using an 8-bp oligomer, 5'-GG ATCGAT-3% 
annealed to a 35-bp oligomer, 5'-CCG A ATTC(C,G,T)i9 ATCGATCC-3' oligomer, 
using the procedures described in Example 1 for synthesis of oligonucleotides. The 
random plasmid population pRAN3 was produced as described above for pRAN4 in 
Example I, by digesting pBR322 with Eco RI and Cla I, removing 5^ phosphates 
with bacterial alkaline phosphatase, purifying the larger fragment by 
electrophoresis, and ligating these randomly synthesized DNA fragments deficient 
in adenine using T4 DNA ligase. Plasmid population pRAN3 contains the sense- 
strand promoter recognition site substitutions of random 19 base sequences of 
cytosine, guanine, and thymine. For this population, there is a maximum of 3 
(or approximately 10^) different, possible replacement sequences, E. coli bacteria 
strain AGl was transformed using 410 ng of pRAN3, as described in Example 1 for 
transformation using pRAN4 DNA, at an efficiency of 10^ to 10^ colonies/yg DNA 
using ampicillin selection and 10 to 10^ colonies/ug DNA with ampicillin and 

tetracycline selection. 

The results of transformation using pRAN3 plasmids are shown above 
in Table IB. Equal amounts of DNA were used for each antibiotic selection. 
Ampicillin selection yielded 887 colonies. Plasmids from 10 of these ampicillin 
resistant colonies were characterized. All 10 of these plasmids, pBAl through 
pBAlO, contain replacement insertions of from 15 to 29 nucleotide basepairs 
^(FIGURE 3), implying that greater than 90% of the plasmids in pRAN3 contain 
promoter substitutions. Of the total of 203 bases substituted among these 10 
plasmids the average insert length was 20, and the composition was 0% adenine. 



32% cytosine, 42% guanine, and 26% thymine* Because the sequence of the 
inserted DN'A determines the final sequence composition of the plasmid, these 
results indicate that the sequence heterogeneity results from ligation of the 
random DNA insert and not from a cellular process randomly mutating nucleotides 
within a sequence. 

Ampicillin and tetracycline selection yielded 23 colonies. Plasmids 
from all 23 of these ampicillin/tetracyciine resistant colonies were characterized 
using plasmid purification and sequencing procedures as described in Example 1. 
21 plasmids were identical to pBR322. The inclusion of adenine in these promoter 
sequences indicates that these are from the background of unmodified vectors and 
are not present within the small subpopulation of all random sequences selected 
here. The other two plasmids, pBT9 and pBT21, contain promoter substitutions of 
19 and 17 nucleotide basepairs, respectively, lacking adenine (FIGURE 3). An 
additional tetracycline resistant colony containing a plasmid, pBTR3 (FIGURE 3), 
was detected using replica plating of the colony growing on the ampicillin media 
onto ampicillin/tetracyciine media. In this technique, bacteria colonies are 
removed from one plate and placed in the same spatial orientation onto a new 
plate containing ampicillin and tetracycline (data not shown). Although there was 
sequence length heterogeneity, the average length of all pRAN3 substitutions, 
including those not selected for tetracycline, was 20 bases, close to the 19 
basepairs of the synthetic DNA sequences used for mutagenesis. Therefore, 
greater than 90% of the 887 ampicillin resistant colonies contained promoter 
substitutions and about 2 of these colonies were also tetracycline resistant, 
suggesting that approximately 0.2% of the 10^ possible random sequences present 
in this construction may duplicate promoter recognition site activity. As 
expected, the absence of adenine reduced the frequency at which promoter 
recognition sites were selected from random sequences. 

EXAMPLE 3 
Correlation Between Tetracycline Resistance 
and Transcription of Tetracycline Resistance DNA 

In order to investigate the correlation of the tetracycline resistance 
phenotype with transcription of the tetracycline resistance gene, E. coli strain 
DH5.1 cells bearing plasmids obtained in Examples 1 and 2 and containing 
promoter recognition site replacements were tested for resistance to tetracycline 
fright-hand column of FIGURE 3). The range of tetracycline resistance conferred 
by the plasmids selected for ampicillin resistance only (pBB3, pBB5, pBB9, pBBlO, 
pBB13, and pBAl through pBAlO), was between 2 and 10 ug/mi. For the 
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ampioillin/tetracycline selected plasmids (pBG8, pBT9, pBT21, and pBTR3), the 
range was between 30 and 60 ug/ml, while pBR322 was resistant to 40 pg/ml. Both 
pBT9 and pBT21 were found to be resistant to concentrations that inhibited 
pBR322, i.e., 50 and 60 yg/ml, respectively. Tetracycline resistance was measured 
using the EOP^q method as described above in Example 1. 

To show the direct correlation between levels of tetracycline 
resistance and the presence of tetracycline resistance transcript, a northern blot 
analysis was performed, as described above, and used to quantify transcription 
from the tetracycline resistance gene (FIGURE 2). Cellular nucleic acids (both 
DNA and RNA) were probed with the 787 basepair EcoR V to Nru I restriction 
fragment from the protein coding region for tetracycline resistance described 
above in Example 1, The absence of hybridization in strain DH5.1 lacking plasmid 
reveals the specificity of this probe. Hybridization of nucleic acids from strain 
DH5,1 E. coli harboring pBR322 detected three bands. The two highest in 
15 molecular weight were plasmid DNA, while the third band (lowest molecular 

weight) contained the transcript from the tetracycline resistance gene of about 
1.4 kb maximum length- DH5.1 containing either the tetracycline sensitive 
promoter deletion plasmid, pBdEC (Example 1), or the tetracycline sensitive 
promoter substitution plasmid, pBA5 (Example 2, pRAN3), revealed an absence of 
transcription from the tetracycline resistance gene. DH5.1 harboring the 
tetracycline resistance promoter substitutions, pBG8 (from pRAN4 population) and 
pBT9, pBT21, and pBTR3 (from pRAN3 population), exhibited a varying level of 
tetracycline resistance transcript. The amount of tetracycline resistance 
transcript for each plasmid was quantified using densitometry of the northern blot 
obtained. Transcript levels were normalized to plasmid DNA concentrations by 
taking the quotient of the values for the 1.4 kb band and the plasmid DNA bands. 
Table 2 shows the correlation of phenotype with transcription. 
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TABLE 2 

Correlation of Phenotype and Transcription 
Plasmid PBT21 pBT9 pBG8 pBR322 pBTR3 pBA5 pBdEC 



tetracycline i 25 1 00 1 0.75 0,05 0.05 

resistance l«i> i.i&o x.uu x 



tet^ 



^transcript 
35 f 



1.27 1.17 1.01 1 0.290 0 0 



All values are expressed relative to those obtained in pBR322, Thus, 
there was a direct correlation between the levels of tetracycline resistance and 
the tetracycline resistance transcript* Therefore, the tetracycline resistant 
phenotype provides a good estimate of promoter strength in these Examples, 

EXAMPLE 4 

Demonstration that Tetracycline Resistance is not Produced by Host Cell Mutation 

To verify that the promoter substitution was in fact the cause of the 
tetracycline resistant phenotype, and that a plasmid mutation rather than a 
chromosomal mutation was responsible for the phenotype, secondary 
transformations using plasmids purified from the primary ampicillin 
resistant/tetracycline resistant transformants were performed. Plasmids pBG8, 
pBT9, pBT21, and pBTR3 (FIGURE 3) were used to transform E. coli strain DH5 
(obtained from Bethesda Research Laboratories) using tetracycline selections with 
an efficiency approximately equal to that of pBR322. Therefore, tetracycline 
resistance was conferred by mutation of the promoter region on the plasmid, and 
not by mutation within the host bacteria. 

Discussion 

The above Examples demonstrate the usefulness of the present 
invention for producing novel DNA sequences capable of promoter activity by 
biological selection of a population of synthetic DNA fragments heterogeneous in 
sequence. Deletion of 19 basepairs at the -35 promoter region of the tetracycline 
resistance gene of pBR322 abolished transcription from the tetracycline 
resistance gene of the plasmid. Substitution of these 19 basepairs with random 
DNA sequences resulted in a maximum of 4^^ (3 x IQ-^-*^) possible replacement 
sequences. In a population of about 1000 bacteria that were found to harbor 
plasmids with these random substitutions, tetracycline selection identified several 
functional -35 promoter sequences. These novel promoters exhibited only partial 
homology to the -35 promoter consensus sequence. Further, two of the sequences 
(plasmids pBT9 and pBT21, FIGURE 3) were found to promote transcription more 
strongly than the native promoter. These promoter sequences differ from all 
previously identified promoters and promoter mutations. 

In the tetracycline resistance gene of pBG8, pBT9, and pBT21, in -^^ivo 
transcription initiates ten nucleotides downstream from the pBR322 initiation 
site, as determined from mapping the 5' end of the tetracycline resistance mRNA 
fusing primer extension as described by McKnight et al.. Cell , 25, pp. 385-398 
(1981). Briefly, an oligonucleotide primer was hybridized to a sequence within the 
tetracycline resistance gene. Synthesis of a complementary strand by reverse 
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transcription produced a DNA sequence whose length is indicative of the distance 
between the 5' mRNA end and the location of the known primer. 

While all four of the promoter recog^nition regions obtained in the 
above Examples retain homology with the consensus sequence, in three of those 
sequences, pBG8, pBT9, and pBT21, the consensus alignment is shifted ten 
nucleotides downstream (FIGURE 3). In these three promoters, in vivo 
transcription initiation also shifts ten nucleotides downstream. The RNA 
polymerase therefore recognizes a new Pribnow box from within the original 
pBR322 sequence. A candidate Pribnow box, ten nucleotides downstream from the 
original Pribnow box, is TGCGGTAGTTT, wherein agreement with the consensus 
sequence has been underlined. Supporting evidence comes from a mutation in the 
lac promoter in which a base substitution at position +1 (downstream) activates a 
latent Pribnow box to initiate transcription at position +13. Maquat et ai., J. Mol. 
Biol., 139, pp. 551-556 (1980). 

The functional tetracycline resistance promoter in pBTR3 has 
conserved the consensus spacing of 17 nucleotides between the position -35 and 
-10 promoter elements. However, the nonfunctional tetracycline resistance 
promoter in pBA7 retains substantial homology with the consensus sequence, but 
not the spacing. These promoter mutations demonstrate the significance of the 
spacing between the two promoter elements. 

From among the more than 150 known promoters and promoter 
mutations (McClure, Ann. Rev. Biochem. , 54, pp. 171-204 (1985)), the only "up" 
mutation that decreases homology to the consensus sequence is found in -35 region 
of the arg promoter (Horwitz et al., J. Bacteriol. , 142, pp. 659-667 (1980)), and 
there are no "down" mutations that increase homology with the consensus 
sequence. Among the sequences described herein, pBTR3 contains decreased 
homology with the consensus sequence, relative to pBR322, and is a down 
mutation. However, pBT9 and pBT21 contain tetracycline resistance promoters 
with decreased consensus homology at both promoter elements, yet are both up 
mutations. While these results highlight the importance of the promoter 
consensus sequence, they also surprisingly indicate that deviations from the 
consensus need not necessarily decrease promoter activity. 

EXAMPLE 5 

Demonstration that Synthetic DNA Sequences Function Within the Plasmld to 
f Produce Tetracycline Resistance 

In order to unambiguously establish that the promoter replacements 
are the elements of the plasmid responsible for tetracycline resistance, one of the 
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plasmids (pBT9) was reconstructed. Oligonucleotides of known sequence were 
used to duplicate the promoter substitutions (FIGURE 3) found in pBT9. Two 
oligonucleotides, each identical to one of the two DNA strands of the promoter 
recognition region in this plasmid were chemically synthesized. 

5 Construction of pBT9R. A 26-bp oligomer, 5*-AATTCTTGGGCGC- 

GCGTCGGCTTGAT-3* (17 ug), was annealed to a 24-bp oligomer, 5*- 
CGATCAAGCCGACGCGCGCCCA-AG-3' (16 ug), in a 1:1 molar ratio in a reaction 
mixture containing 90 mM NaCl, 15 mM tris-HCl (pH 7.9), and 1 mM MgCl2 to 
form a total volume of 200 ul- The mixture was heated at 65°C for 5 min and then 

10 at 57**C for 90 min, followed by immediate chilling on ice for 15 min Incubation 

with T4 polynucleotide kinase in the presence to ATP was used to add 5' 
phosphoryl termini. Maniatis et al., supra. The resulting product contained EcoRI 
and Cla I cohesive ends. The synthetic 26-oligomer was of the same sequence as 
pBT9 in the nontemplate strand extending from the EcoRl restriction site at 

15 position -47 to the Cla I restriction site at position -21. The complementary 

synthetic 24-oligomer had a sequence equivalent to that of the template strand of 
pBT9 (position -43 to position -18). The annealed synthetic oligonucleotides were 
then ligated into the KcoRI and Cla I-digested pBR322. The resulting construct 
was designated plasmid pBT9R. 

20 E- coli strain DH5 was transformed using 50 ng of pBT9R DNA to 

produce 353 ampicillin resistant and 114 ampicillin resistant/tetracycline resistant 
colonies, at an efficiency of 7 x 10^ coionies/yg DNA using ampicillin selection, 
and 2 x 10*^ colonies/ug using ampicillin and tetracycline selection (Table IC). 
Plasmids from 4 of the ampicillin resistance/tetracycline resistance colonies were 

25 characterized, as described above, by plasmid purification and DNA sequencing. 

Plasmid sizes were compared using agarose gel electrophoresis, and approximately 
200 basepairs centered at about position -3 5 were sequenced. The results are 
shown in FIGURE 3. Plasmids from ail four colonies contained insertions in the 
expected region. Three colonies contained a plasmid, pBT9R, identical to that of 

30 pBT9, while the other colony contained a plasmid that included the sequence 

present in pBT9 but with an extra insertion of three basepairs upstream. In the 
absence of plasmid, DH5 was found to be resistant to a tetracycline concentration 
of 2 yg/ml. When harboring pBT9R, the reconstruction of pBT9, DH5 was resistant 
to a tetracycline concentration of 50 ug/ml, the same level of resistance which the 

35 ^original pBT9 conferred on DH5.1. Therefore, tetracycline resistance was 
conferred by the promoter substitutions and not by a mutation elsewhere on the 
plasmid. 
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Discussion 

The results of the above Examples are summarized in FIGURE 3. The 
consensus promoter sequence as previously identified by Hawley and MeCiure, 
NucL Acids Res. , 11, pp. 2237-2255 (1983), is shown on the top line of FIGURE 3. 
The bases found to be most strongly conserved in the consensus are shown in upper 
case. The sequence of pBR322 is depicted next, below the consensus sequence. 
Following pBR322 are the substitutions for the promoter sequence obtained by 
insercion of the synthetic DNA sequences as described in preceding Examples. 
Spaces have been inserted before and after the pBR322 promoter sequence and the 
promoter substitutions to maximize alignment with the consensus sequence and 
facilitate visual comparison. Portions of sequences exhibiting homology with the 
consensus sequence are enclosed in boxes. Dashes indicate positions in the 
sequence identical to that in pBR322. Bases within the dashes which match the 
pBR322 indicate positions within pBR322, outside of the substitution obtained 
using the synthetic DNA sequences, that, together with the substitution, provide 
homology with the promoter consensus sequence (-3 5). Two plasmids, pBTR3 and 
pBA8, possess downstream insertions, a possible artifact from plasmid 
construction, indicated as subscripts in the dashes (e.g., TCC in pBTR3). Three 
plasmids, pBA7, pBT9, and pBT21, have two sites of potential alignment with the 
consensus sequence; although not aligned, bases in the alternate site are 
underlined in the FIGURE. Tetracycline resistance was determined as described 
above. 

EXAMi?LE 6 
Selection of Novel Leader Sequences 

Plasmid construction. A piasmid designated pBdBLA is constructed 
containing DNA encoding the gene for e-lactamase, lacking the coding region for 
the leader sequence for this protein, and containing a restriction endonuclease 
recognition site at the Immediate 5* terminus of the coding region, so that 
synthetic, random DNA sequences may be conveniently ligated in place. 

The parent plasmid pBdBLA, into which the random DNA sequences 
will be inserted, is constructed using techniques of M13 site directed mutagenesis 
similar to that described by D. M. Glover, supra. The modification of the pBR322 
6-lactamase gene is more easily accomplished if the gene is temporarily 
transferred to a bacteriophage M13 vector. The M13mp family of bacteriophage 
^vectors facilitates site-directed mutagenesis because they can be conveniently 
isolated in single-stranded form. The EcoRI to Pst I restriction fragment of 
pBR322 is cloned into M13mplO. The 5^ portion of the 3-iactamase gene is 
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oontained on the EcoRl to Pst I restriction fragment of pBR322. MlSmplO is 
particularly useful because it contains unique EcoRI and Pst I restriction sites. 
Messing-, J., Methods in Enzymology , 101, p. 20 (1983). 

Restriction endonucleases and enzymes of nucleic acid metabolism 
5 are purchased from commercial sources and used according to the manufacturers' 

instructions. In addition to the well-known protocols for M13 site-directed 
mutagenesis, standard techniques of molecular cloning are used. The plasmid 
pBR322 is digested with EcoRI and Pst I, and this restriction fragment is purified 
by electrophoresis in low melting temperature agarose. The restriction fragment 
10 is ligated into similarly digested M13mpl0 vector. 

Two mutagenic steps are required. The first produces a deletion of 
the existing leader sequence. The second produces an insertion of a unique 
restriction site so that the random DNA sequences may be ligated in place. 

In the first step, the distal 66 bases of the 69 base DNA sequence 
15 coding for the leader polypeptide are deleted. The first three bases, functioning 

as an ATG initiation codon, are preserved. The mutagenic 20-mer oligonucleotide, 
5»-XTTCTGGGTGCATACTCTTC-3% is hybridized to the s-lactamase gene in the 
single-stranded M13mpl0 recombinant plasmid, so as to abridge (i.e., remove or 
"loop-out") the leader sequence. The mutagenic 10-mer primer consists of two 10- 
base-long "arms" which flank either side of the leader sequence. In -^itro copying 
using the large (Klenow) fragment of DNA polymerase I produces a partially 
duplex plasmid in which one strand contains a leader sequence deletion. Complete 
duplication of the mutant DNA strand is accomplished in vivo. The resulting 
deletion is verified using standard DNA sequence analysis, as described above in 
25 Example 1. 

In the second step, a mutagenic primer that is a derivative of the first 
primer is used to insert a unique Bgl II restriction site at the 5' terminus of the 
leaderless, S-lactamase gene. pBR322 lacks a Bgl II site. The mutagenic 26-mer, 
5'TTTCTGGGTGAGACCTCATACTCTTC-3% is hybridized to the Leaderless S- 
lactamase gene in the single-stranded M13mpl0 recombinant plasmid, so as to 
insert a six- base Bgl II restriction site (5*-AGATCT-3') immediately 3' to the 
remaining ATG initiation codon. As above, both in vitro and then in vivo 
polymerization are used to complete the synthesis of the mutant DNA strand. The 
resulting insertion is verified by DNA sequence analysis, as above. 

I Finally, the mutagenized s-lactamase gene fragment is recovered 

from M13mpl0 and reinserted into pBR322 to complete the construction of 
pBdBLA. The ml3mpl0 recombinant plasmid is digested with EcoRl and Pst I, and 
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the restriction fragment is purified by agarose gel electrophoresis. The 
restriction fragment is then iigated into similarly digested pBR322, The resulting 
product, containing a leaderiess B-iactamase gene (with a unique restriction 
enzyme adjacent to the ATG initiation codon), is verified using DNA sequence 
analysis. 

Production of random DNA sequences. Random DNA fragments 
containing Bgl II compatible ends may be Iigated into pBdBLA to select for leader 
function. To produce these random DNA fragments a 74-mer oligonucleotide, 5- 
GGGGAGATCT(A,C,G,T)54AGA-TCTGGGG-3% is hybridized to a complementary 
lO^mer oligonucleotide, 5'-CC-CCAGATCT-3'. In a manner equivalent to that set 
forth above, in Examples 1 and 2, the partial duplex is copied with the large 
Klenow fragment of DNA polymerase I and digested using Bgl II. The restriction 
fragments are Iigated into pBdBLA that has been cleaved with Bgl II and treated 
with bacterial alkaline phosphatase to produce a population of plasmids containing 
heterogeneous random DNA sequences inserted in place of the deleted native 
leader sequence. 

The random stretch of 54 bases, coding for 18 amino acid residues, 

any of which may be one of the 20 amino acids used in protein synthesis, will allow 

for a maximum of 20-^^ (about 3 x 10^^) different possible sequences. (Four more 

amino acid residues, an argenine and a serine residue at both sides of these 18 

residues, will be coded for by the Bgl II recognition site). Out of every 64 codons, 

1 8 

61/64 will not be termination codons; thus, there will be a total of (61/64) x 
20-^^ (approximately 10^^) random sequences that will not contain termination 
codons and will allow for an open reading frame. The fraction of random 
sequences containing open reading frames, and the residue composition (i.e., 
relative hydrophobicity or hydrophilicity) may be altered by biasing the DNA base 
composition in the random sequence. For example, because all termination codons 
contain adenine, a deficiency of adenine in the random stretch will greatly reduce 
the frequency of termination codons. Because the random insertions are of length 
equal to the native sequence, frameshifts are not expected. 

To select those DNA sequences with secretory activity, the random 
plasmid population is used to transform E. coli using selection with ampicillin and 
tetracycline as described above in Example 1. Tetracycline will select for 
bacteria containing plasmids bearing inserts (as the pBR322 tetracycline 
^resistance gene is left intact in pBdBLA), and ampicillin will select for functional 
leader sequences. The random plasmid population transforms £. coii strain DH5 at 
high efficiency (Hanahan, supra,). The tetracycline concentration in the media is 
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12.5 pg/ml, and ampiciilin concentration is 50 ug/ml. Transformants are amplified 
by growth in LB at ST^'C for 1 hour (approximately two generations) prior to 
antibiotic selection on LB agar. Because pBdBLA contains a deletion of the leader 
sequence, pBdBLA will be ampiciilin sensitive. Therefore, ampiciilin resistant 
5 transformants will contain functional leader sequences. These novel, functional 

leader sequences may be further characterized by DNA sequence analysis. All 
resulting sequences will be flanked by Bgl II restriction sites, facilitating their 
transfer to other genes and/or plasmids, for further analysis or subsequent use. 

The above procedure allows for the direct selection, rather than the 
10 tedious screening, of functional leader sequences. Leader sequences are thus 

selected from a population of synthetic random DNA, rather than from organismal 
DNA of known or putative secretory activity. Because novel leader sequences 
may be identified, useful and unique properties may be selected. For example, 
this technique may be used to create leader sequences capable of functioning in a 
15 wide variety of different organisms, or leader sequences that retain activity in 

inhospitable physical environments. 

EXAMPLE 7 

Selection of Novel Proteins 
Plasmid Vector. The plasmid ptac-85 has been engineered for the 

20 expression of gene fragments lacking a promoter, a ribosome-binding site, and an 

initiation codon. Marsh, P., Nucleic Acids Research , 14, p. 3603 (1986), 
incorporated by reference herein. Another useful plasmid is pKK 233-2, available 
from Pharmacia Fine Chemicals (Piscataway, NJ). Insertion of DNA at the BamH 
I site of this plasmid will produce an open reading frame, from which the random 

25 DNA fragments may be translated into random proteins. Recombinant ptac-85, 

containing inserted random DNA produced as described below, is used to 
transform E. coli strain DH5 at high efficiency. Hanahan, supra. Because ptac-85 
contains an ampiciilin resistance gene, growth in ampiciilin will select for 
transformants. 

3Q Production of random DNA sequences. Long, random DNA sequences 

are produced using the enzyme terminal deoxynucieotidyl transferase (hereafter 
"TdT") or by chemical synthesis. In either case, standard techniques of molecular 
cloning are employed throughout. Maniatis et ai., supra. All reagents and 
enzymes, including TdT, are available from Pharmacia, as well as from other 

35 ^suppliers. 

TdT will catalyze the random addition of deoxynucleotide triphos- 
phates to the 3' termini of DNA, and has previously been used to produce random 
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protein coding DNA sequences of 400 nucleotides in length. Damiani, G., et al.. 

Nucleic Acids Research , 10, pp. 6401-6410 (1982), incorporated by reference 

herein. Briefly, this protocol consists of the following: (i) randomly tailing with 

TdT a short, single-stranded random oligomer to produce a single-stranded DN"A of 

about 900 nucleotides in length; (ii) making this single-stranded DNA double 

stranded by hybridizing it to the same short, single-stranded random oligomer and 

copying it with the large fragment of DNA polymerase I; (iii) blunt-end digesting 

the resulting double-stranded DNA using the single-strand-specific enzyme SI 

nuclease; and (iv) ligating BamH I linkers onto the final, random, double-stranded 

DNA in preparation for cloning into the parent vector. Single-stranded DNA of 

5 nucleotides in length (p(dN)g), from a limit digest of calf-thymus DNA 

(Pharmacia, Piscataway, NJ), is randomly tailed by TdT. A typical 100 ul reaction 

containing 50 ng p(dN)g, 10 units TdT, 100 mM potassium cacodylate (pH 7.0), 

1 mM C0CI2, 0.2 mM dithiothreitol, and 0.25 mM of each of the deoxynucleotide 

triphosphates (dNTPs) is incubated for 1 hour at 37°C- This reaction produces 

approximately 9 yg of random DNA sequences of a mean length of approximately 

900 nucleotides. The resulting product is purified by phenol/chloroform extraction 

and ethanol precipitation. The length of the resulting product may be verified 

3 2 

using electrophoresis on a 1.5% agarose gel, or, if a label such as a- phosphorus 
is incorporated into the dNTPs, both the length and base composition of the 
resulting product may be determined by precipitation in 10% trichloroacetic 

acid. Maniatis et al., supra. 

The base composition of the final product may be biased by altering 
the dNTP concentrations in the initial reaction. For example, to reduce the 
frequency of termination codons appearing in the random DNA sequences, the 
concentration of dATP may be reduced, because ail three of the termination 
codons contain adenine. To make the DNA from the above reaction double- 
stranded, it is hybridized with a five-fold molar excess of p(dN)g in 90 mM NaCl, 
15 mM tris-HCl (pH 7.9), and 1 mM MgCl2 in a total volume of 120 yl by heating at 
65*^0 for 5 min and then 57°C for 90 min, followed by immediate chilling on ice 
for 15 min The concentration of this mixture is then adjusted to 2 mM MgClg* 1 
mM dithiothreitol, 100 uM of each of the dNTPs, and 25 units of the large (Klenow) 
fragment of DNA polymerase [ in a final volume of 240 ul and incubated at 37**C 
for 60 min The approximately 18 ug of double-stranded DNA produced from the 
above reaction is purified by phenol/chloroform extraction and ethanol 
recipitation. Next, the DNA is digested with the single-strand-specific SI 
nuclease to ensure that it is blunt ended. The above DNA is reacted with 100 



units of SI nuclease In 200 ul containing 30 mM sodium acetate (pH 4,6), 50 mM 
NaCU 1 mM ZnCl2, and 5% glycerol and incubated at 37'' C for 30 min The 
resulting blunt-ended, double-stranded DNA (approximately 15 ug) is purified by 
phenol/chloroform extraction and ethanoi precipitation. To prepare the blunt- 
ended, double-stranded random DNA for insertion into the cloning vector, this 
purified DNA is ligated to BamH. I linkers (Pharmacia, Piscataway, NJ) and 
digested with BamH I, both done using standard techniques, Maniatis et al,, supra 

The random DNA is next ligated into the parent vector, using 
standard techniques. 

Growth Selection. The novel catalytic activity is identified from the 
above transformants by growth in a selective medium. To isolate DNA sequences 
encoding a novel protein capable -of conferring tetracycline resistance, the above 
transformants are grown on LB agar containing 12.5 ug/ml tetracycline. 

One construction of a random plasmid population may suffice to 
select for a plurality of novel protein activities. For example, to create each new 
catalytic activity, only the final selection phase need be altered. Additionally, 
since each construction of a random plasmid population will contain a small subset 
of all possible protein sequences, for any given catalytic activity, each random 
plasmid population will yield a different protein sequence. 

An alternative application of the above procedure is its use to modify 
only the active site of a known protein. In an entire protein DNA sequence, only a 
few amino acid residues (i.e., those encoding the "active site") are actually 
involved in enzymatic catalysis. By mutagenizing DNA encoding a gene so as to 
create a heterogeneous population of modified genes, each different and random 
in sequence at the active site, DNA sequences encoding new or modified catalytic 
activities may be selected. 

Although targeted mutagenesis has been used by others to define the 
location of functional sequences within prokaryotic and eukaryotic DNA, the base 
changes from the wild-type sequence were limited in number, presumably because 
of the premise that most functional sequences resemble those already known. In 
these methods, new, functional DNA sequences were not produced. In contrast, 
the method of the present invention applies biological selection to an 
exceptionally large and diverse population of random DNA sequences, and enables 
rapid screening of large amounts of synthetic DNA to identify novel functional 
^DNA sequences. The present method may be used to select novel sequences from 
large populations of possible DNA sequences, rapidly and with relatively little 
effort, since transformation and selection may be accomplished without requiring 
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prior characterization of the synthetic DNA sequences inserted. Because this 
process allows for direct selection from larg'e, random populations of DNA 
sequences, prior knowledge of structure-function relationships with respect to 
specific reg'ions of DNA is not required to g'enerate novel, functional sequences. 
However, the process of this invention may be used to provide information as to 
structure and function relationships of specific regions of DNA encoding biological 
activity. This invention may be used for the selection of DNA sequences encoding 
peptide hormones, and related effector molecule.^ such as growth factors, leader 
sequences for secreted peptides, catalytic domains, genetic markers, and even 
entire enzymes and other proteins in prokaryotes or eukaryotes. Genetic 
regulatory elements, including promoters, enhancers, origins of replication, 
transcription terminators, centromeres, and telomeres may also be produced or 
evaluated by the method of this invention. 

EXAMPLE 8 

Mutants Generated by the Insertion of Random Oligonucleotides 
into the Active Site of the B-Iactamase Gene 

We have remodeled the gene coding for B-lactamase by replacing DNA at the 
active site with random nucleotide sequenc^es. The oligonucleotide replacement 
• (Pt t e^^ ^ X X Ser'^ -g---j^-X~LysI3>^^j^se^^^^ co'^orf'feV the\^\ive^^br1ne-70 but 
also contains 15 base pairs of chemically synthesized random sequences that code 
for 2.5 X 10^ amino acid substitutions (x). From a population of E. coli infected 
with plasmids containing these random inserts, we have selected seven new active 
site mutants that render E. coli resistant to carbenicillin and a series of related 
analogs. . Each of the new mutants contains multiple nucleotide substitutions that 
code for different amino acids surrounding serine-70. Each of the mutants 
exhibits a temperature sensitive B-lactamase activity. This technique thus 
permits the construction of alternative active sites in enzymes based on biological 
selection for functional variants. 

For studies on the active sites of enzymes we chose to insert random 

nucleotide sequences into the gene for RTEM-1 6-lactamase (EC 3*5.2.6) that is 

present in the plasmid pBR322 (Mathew & Hedges, 1976; see the citations listed 

below). Bacterial s-lactamases hydrolyze the S-lactam ring of penicillin or 

cephalosporin transforming them into reactive metabolites. The mechanism of 

catalysis by the class A enzymes involve a transient acyiation of the serine 

^residue at position 70 (Ambler, 1980). The nucleotide sequence and three 
f 

dimensional structure of several class A B-lactamases have been determined 
(Herzberg & Moult, 1987) and indicate a high level of conservation of amino acids 
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surrounding the active Ser-70» Schultz and Richards (1986) used site saturation 
mutagenesis to show that even though Thr-71 is conserved, it can be replaced by 
14 of the 20 amino acids substituted. Thus, despite the evolutionary conservation 
of one amino acid within the active site there could be a high degree of tolerance 
for substitutions, 6-lactams and cephalosporins are among the most frequently 
prescribed class of pharmaceuticals worldwide, and the rapid evolution of 
0-lactamases in pathogenic bacteria continues to defeat the best efforts of 
chemists to create new resistant analogs (Bush, 1988), 

We have replaced a portion of the active site of S-lactamase in the plasmid 
pBR322 with an oligonucleotide that retains the codon for the active Ser-70 but 
also contains two flanking sequences of six and nine random nucleotides. In these 
experiments we have screened 2 X 10^ tetracycline-resistant (tet^) colonies of 
E, coli infected with plasmids containing random inserts and obtained seven new 
carbenicillin resistant mutants. DNA sequence analyses of the mutants indicate 
that multiple amino acid substitutions within the active site can be tolerated and 
are compatible with enzymatic activity. Furthermore, nucleotide substitutions 
involving evolutionarily conserved amino acids alter substrate specificity and 
temperature stability. 

Experimental Procedures 
Materials. Oligonucleotides were synthesized using phosphoramidite 
chemistry by Operon Technologies (San Pablo, California). The following two 
oligonucleotides, each 47 nucleotides in length, were used as templates for the 
construction of the random inserts: 

i) 5»-CGCCCCGAGGAACGTNNNNNNNNNNNNNNNNNNNNNNNAGTACTGCT-3*; 

ii) 5*-CGCCCCGAGGAACGTTTTNNNNNNNNNAGCNNNNNNAAAGTACTGCT-3. 
The stretches of random nucleotides (designated by the Ns) within these inserts 
were synthesized using equimolar mixtures of nucleoside phosphoramidite 
derivatives. Restriction endonucleases were obtained commercially and were used 
according to the suppliers* instructions. Standard molecular cloning methods were 
employed (Maniatis, et al., 1982). 

Plasmid construction. The vector for the insertion of random sequences, 
pBDL, is a modification of pBR322 containing within the ampicillin gene two 
unique restriction sites located on either side of the nucleotides coding for 
Ser-70. pBDL was assembled by iigating together two modified segments of 
pBR322 (FIGURE 4). The first segment was obtained from the plasmid pBR322-R, 
a generous gift of Dr. J.H. Richards (California Institute of Technology, Pasadena, 
CA). It is a modification of pBR322 that contains an Ava I and a Sea I site 
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centered at positions 3972 and 3937, respectively, as well as an additional Aval 
site at 1425 (Schultz & Richards, 1986). Digestion with Pst I and Sph I yielded a 
fragment of 1315 bp that contained a portion of the ampicillin gene and lacked the 
second Aval site. 

The second segment was obtained by the following steps: digestion of wild- 
type pBR322 with Ava I and Pvu II, purification of the large fragment by 
electrophoresis and eiectroelution, filling in the Ava I termini with E. coli Pol I, 
and blunt end ligation with T4 DNA ligase. The resultant 3725 bp plasmid was also 
digested with Sph I and Pst I, and the 2410 bp fragment was purified by 
electrophoresis and ligated onto the first segment. 

Synthesis of Random Oligonucleotides. The double-stranded oligonucleotide 
used in the construction of the nonproducer strain, pBNP, was synthesized by 
hybridizing 200 ng of 9-mer primer, 5'-AGCAGTACT-3% to 1 ug of the single- 
stranded oligonucleotide template, 5'-CGCCCCGAGGAACGT (N)23 AGTACT- 
GCT-3% in 20 mM Tris-HCl (pH 7.5), 10 mM MgCl2, 50 mM NaCl, and 1 mM 
dithiothreitol at 65*^C for 10 min This template-primer was extended with the 
large fragment of E. coli Pol I, digested with Ava I and Sea I and purified by 
polyacyiamide gel electrophoresis. The double-stranded oligonucleotide for the 
construction of the plasmid used in selecting new mutants was synthesized by a 
similar protocol: 200 ng of 9-mer primer 5'-AGC AGTACT-3^ was hybridized to 
1 ug of the template oligonucleotide: 

5' -CGCCCCGAGGAACGTTTT (N)^ AGO (N)^ AAAGTACTGCT-3\ 
The template-primer was extended with E, coli Pol I, digested with Ava I and 
Sea I, and used as a replacement for the insert in the nonproducer strain. 

Other Methods. The preparation of competent DH5a and DH5 E. coli and 
subsequent transformation with plasmid DMA were carried out according to the 
protocol of Hanahan (1983). Transf ormants were grown in *^SOC" medium for 1 hr 
prior to antibiotic selection. We scored for carbenicillin resistance by incubating 
the transfected E. coll in petri dishes in agar containing 12.5 yg/mi tetracycline 
and 50 pg/ml carbenicillin at 30"C for 48 hrs. Plasmid DNA was purified by the 
alkaline lysis method (Maniatis, et al., 1982), and sequencing of both strands was 
carried out on doubie-stranded DNA purified by isopicnic density centrif ugation 
using dideoxy chain termination {Sanger, et al., 1977). 3-lactamase activity was 
scored using the chromogenic cephalosporin: pyridinium-2-azo-p-di methyl aniline 
chromophore (PADAC) (Kobayashi, et al., 1988). An overnight culture of each^ 
^est mutant was diluted 1,000-fold with the fresh broth, and a 5 ul inoculum (10 
CFU per spot) of each sample was applied onto agar plates containing 50 uM 
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PADAC and 12.5 ug/ml tetracycline. After 18-20 hr incubation at SO^'C the 
diameter of the PAD AC hydrolysis zone formed around the colony was 
determined. The highest antibiotic concentration permissible for growth of £. coZt 
was determined using antibiotic concentration gradients generated in L-agar 
5 plates (Schultz, 1987). 

Results 

In order to substitute random DNA sequences for designated nucleotides at 
the active site of the 0-iactamase gene we first constructed a derivative of 
plasmid pBR322 that contains two unique restriction sites flanking the targeted 

10 sequence (see Experimental Procedures). The new DNA vector contained both the 
tetracycline and ampicillin resistance genes. Within the ampicillin resistance 
gene are Ava I and Sea I sites centered at positions 3329 and 3294, respectively. 
Since it is likely that only a small fraction of random nucleotide sequences at the 
active site of 3-lactamase code for viable amino acid substitutions, and since it is 

15 difficult to completely cleave restriction sites when present at the ends of double- 
stranded DNA (Horwitz Loeb, 1986), we designed a two step strategy to 
minimize contamination with wild-type sequences. We first constructed a 
nonproducer plasmid, pBNP, that contains an inactive nucleotide replacement, 
obtained DNA from isolated clones, and then exchanged the inactive sequence 

20 with a random nucleotide sequence to be used for mutant selection. 

The nonproducer was obtained by digesting the modified pBR322 (FIGURE 4) 
with Ava I and Sea I and inserting a double-stranded 47 oligomer that contains a 
23 base random nucleotide sequence (FIGURE 5A). Plasmid infected E. coli were 
selected based on resistance to tetracycline and then screened for sensitivity to 

25 low concentrations of carbenicillin (10 ug/ml). A tet^ clone was selected that was 
sensitive to all of the analogs tested and exhibited no detectable 6-lactamase 
activity (FIGURE 6), DNA sequence analysis demonstrated that this clone 
contains a 23 base pair insert between Arg-66 and Val-74 (the Ava I and Sea I 
sites) and codes for an amino acid sequence having no homology with the wild type 

30 sequence and also containing a single base frameshift at the carboxy terminus of 
the insert. 

For the construction of active site mutants, DNA from the nonproducer 
strain was digested with the same two restriction enzymes and purified by agarose 
gel electrophoresis. Into this DNA was ligated an oligonucleotide containing an 
35 ^AGC Godon for serine flanked by stretches of nine and six random nucleotides 
(FIGURE 5B). Nonselective growth in the presence of tetracycline in four 
separate experiments yielded a total of approximately 2 x 10^ colonies. Seven of 



the tetracycline resistant colonies were also resistant to earbeniciUin (50 yg/ml) 
(FIGURE 6, mutants 1-7). Each of the earbeniciUin resistant clones contained the 
AGC codon at amino acid position 70. The distribution of nucleotides within the 
random positions in the mutants is: 39% C, 28% T, 23% A, and 10% G, The less 
than equal representation of G in the coding strand of the insert could be the 
result of a bias in the incorporation of nucleotides during chemical synthesis or 
could be indicative of the repertoire of substitutions that yield active molecules. 
Considering- ail seven mutants, 53% of the nucleotides in the random positions do 
not correspond to those in the parent strain. Of the 35 codons capable of being 
substituted, 32 contained nucleotide substitutions; 10 of these were silent, and 22 
resulted in amino acid changes. Of the five substituted amino acids, the proline at 
position 67 is the most conserved; however, it is still lacking in one of the new 
sequences. 

The resistance of coli infected with each of the mutant DNAs was 
quantified by observing the extent of bacterial growth on agar plates containing 
concentration gradients of earbeniciUin, ampicillin or benzylpenicillin. At 30°C, 
the control strain, pBDL, was resistant to each of the antibiotics at >5G0 yg/ml, 
while the nonproducer strain, pBNP failed to grow at the lowest concentration in 
the gradient. The extent of resistance was confirmed by using agar plates 
containing defined concentrations of each of the antibiotics. At 30^C, mutants 2, 
3, 4, and 6 rendered K, coli resistant to the highest concentration of antibiotic 
tested. However, at the elevated temperatures, all of the mutants were more 
sensitive to the antibiotics than was the control, pBDL, Mutant #7 was the most 
sensitive to each of the antibiotics. All the other mutants are more resistant 
towards earbeniciUin than ampicillin or benzylpenicillin, particularly at higher 
temperatures. A chromogenic cephalosporin, PADAG, was used to measure 6- 
lactamase activity of each of the mutant-infected E. coli. In general, the 
resistance of the mutants to earbeniciUin and the other s-lactam antibiotics 
parallels the production by e-lactamase as measured by the hydrolysis of 
PADAC. However, differences in the relative resistance of the mutants to the 
different analogs suggest that some of the s-lactamase mutants exhibit altered 
substrate specificity. 

To unambiguously establish that the replacement at the active site was 
responsible for the earbeniciUin resistance in piasmid-inf ected E. coli, we 
^reassembled one of the new mutants. We chemically synthesized a double- 
stranded oligonucleotide identical to the sequence of nucleotides in mutant #1 
(FIGURE 6). This oligonucleotide was used to replace the insert in the 



nonproducer strain, A comparison of the drug resistance of E. coZi infected with 
mutant #1 containing the biologically selected random sequence and that bearing 
the chemically synthesized insert (Mutant IR) is included in the following 
Table 3. The pattern of resistance is identical. Thus, antibiotic resistance is 
conferred by the substitution at the active site and not by some other mutation 
within the plasmid or within the E. coli chromosome. 

TABLE 3 

Maximum level of resistance and PADAC hydrolysis zone of strains with 

mutations at B-lactamase 



Maximal antibiotic concentration (mg liter) Diameter of 

permitting bacterial growth PADAC hyd- 

rolysis zone 



Strain/Mutant 


Carbenicillin 


Ampicillin 




Benz. 


Penicillin 


(mm) 
30° 


30*" 


37*" 


42° 


30° 


37° 


42° 


30° 


37° 


42° 


Wild Type 


>500 


>500 


>500 


>500 


>500 


>500 


>500 


>500 


>500 


20 


Non Producer 


NG 


NG 


NG 


NG 


NG 


NG 


NG 


NG 


NG 


<4 


Mutant #1 


>500 


75 


NG 


>500 


100 


NG 


250 


NG 


NG 


<4 


Mutant #1R 


>500 


75 


NG 


>500 


100 


NG 


250 


NG 


NG 


<4 


Mutant #2 


>500 


>500 


200 


>500 


>500 


80 


>500 


>500 


50 


14 


Mutant #3 


>500 


>500 


70 


>50p 


160 


NG 


>500 


NG 


NG 


10 


Mutant #4 


>500 


>500 


220 


>500 


330 


75 


>500 


40 


NG 


6 


Mutant #5 


320 


83 


NG 


400 


55 


NG 


250 


NG 


NG 


<4 


Mutant #6 


>500 


>500 


75 


>500 


420 


NG 


>500 


170 


NG 


6 


Mutant #7 


110 


NG 


NG 


310 


100 


NG 


190 


NG 


NG 


<4 



Antibiotic resistance was determined by concentration gradients, as 
described in Experimental Procedures. In Table 3, NG = no growth observed. 
PADAC hydrolysis of less than 4 mm in diameter could not be visualized due to 
growth of bacteria. One of the mutant sequences was duplicated into a second 
plasmid to rule out mutations outside the site of random sequence insertion. The 
insert of mutant #1 was reconstructed in plasmid pBNP by ligating into it a double 
^stranded oligonucleotide: 47-mer 5^-CGCCCCG AGG AACGTTTTC CCGTC AT- 
GAGCATCATCAAAGTACTGCT-3\ and its complement. The double stranded 
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oligo was constructed as described in Experimental Procedures, and then dig'ested 
with Ava I and Sea I before ligation. 

Discussion 

We have replaced nucleotide sequences within the active site of the 6- 
lactamase gene with random chemically synthesized DMA sequences, and selected 
from a heterogeneous population those sequences that render coli resistant to 
carbenicillin. The nucleotide sequence of each of the new mutants has many 
differences from that of the parental plasmid and from any natural 6-lactamases 
so far reported (Bush, 1988; Brenner, 1988; Kelly, et aU, 1980; Spratt & Cromie, 
1988; Nicholas ic Strominger, 1988). This supports the notion that the active sites 
of enzymes may be more flexible than is generally recognized. As designed, each 
of the mutants maintained the codon for Ser-70 present within the oligonucleotide 
insert. Brenner (1988) hypothesized that the Ser-70 in S-lactamases evolved from 
an ancestral enzyme containing cysteine and, in fact, substitution of Ser-70 with 
cysteine yields a B-Iactamase with 1-2% of the activity of the wild type parent 
(Sigal^ et al., 1982)» In six out of the seven mutants we have obtained, the most 
conserved amino acid is Pro-67 followed by Thr-71 (four of seven). Even though 
Thr-71 is conserved in Class-A s-lactamases, it has been reported that 14 out of 
19 single amino acid replacements at this site yield active enzyme (Schultz 6c 
Richards, 1986). Our finding that four of the seven random mutants contain 
codons other than Thr confirms that threonine is not essential for catalysis. 

The amino acids within the active site that determine the substrate 
specificity of g-laetamase are unknown. The differences in resistance to B-lactam 
antibiotics among the mutants we have obtained suggest that some of the mutants 
have an altered substrate specificity (Table 3). The region extending from Ser-70 
to Lys-73 may not be involved in substrate recognition; this region is conserved in 
natural B-lactamases that exhibit differences in substrate specificity (Dale, et al., 
1985; Spratt & Cromie, 1988; Nicholas 6c Strominger, 1988). Alternatively, 
mutations within the region from Pro-66 to Ser-70 may be those responsible for 
alterations in substrate specificity. However, this sequence from Ser-70 to I#ys-73 
may be required for enzyme stability. In the studies of Schultz and Richards, 
most — but not all — substitutions for Thr-71 resulted in decreased activity at 
elevated temperatures, and evidence was presented that this resulted from 
increased proteolysis (Schultz 6c Richards, 1986). We observed a similar thermal 
^lability of mutants selected from the active-site inserts that contained random 
nucleotide sequences in this region. Further studies will be required to determine 
which substitutions reduce the stability of the 6-iactam resistant phenotype and if 
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this thermal inactivation results from enzyme denaturation or increased 
susceptibility to proteolysis. Most of the mutants exhibited a greater resistance 
towards carbenicillin than the other analogs, particularly at 40*^0 (Table 3), and 
this could be the result of selection by carbenicillin. Alternatively, carbenicillin 
could protect against thermal denaturation by preferentially binding to 
6-lactamases; the K^^ for carbenicillin, ampicillin, and benzylpenicillin have been 
reported to be 10, 31, and 21 yM, respectively (Labia, et al., 1979). 

This use of random DNA for the generation of new mutants is based on the 
hypothesis that multiple amino acid substitutions can be tolerated within the 
active sites of enzymes, and that many of these substitutions could yield enzymes 
with altered or even new catalytic activities. Consider a chronology of selective 
prebiotic evolutionary events that might offer advantages to sequences with the 
best fit. Assume that an average gene was initially coded by 2000 nucleotides and 
thus was selected for from a reservoir of 4^000 pc>ssit)ie permutations. Early steps 
in selection could involve DNA and RNA structure, replication, and transcription 
(Orgel, 1986; Eigen dc Schuster, 1977). Selections based on protein structure and 
specificity of catalysis are likely to be relatively late events, and might be limited 
by the interdependence of metabolic pathways and by the stringencies of protein^ 
protein interactions in multicomponent systems. A smaller number of possible 
permutations would be obtained if genes were assembled in units on the basis of 
structural domains (Schultz, et al., 1987; Savageau, 1986). In either case, as a 
consequence of progressive selective processes, a large number of the potential 
nucleotide arrangements that were eliminated early in evolution may nevertheless 
code for active enzymes. 

The overall frequency of multiple amino acid codon substitutions in the 
random collection of S-lactamase mutants is much higher than that found in 
nature* By selecting active genes from random DNA inserts it might be possible 
to circumvent the sequential selective pressures that have occurred during 
evolution. Using a series of small random oligonucleotide inserts we should be 
able to identify most substitutions that yield reactive molecules and thus define 
the topology of the active site on enzymes. Small changes in the structural 
configuration at the active site may have profound influences on the rates and 
specificities of enzymatic reactions and/or thermo- and proteolytic stability. 
New active sequences selected from random DNA inserts might be able to 
I catalyze reactions at a rate greater than that of the native enzymes or might 
utilize unusual substrates and thus be of practical Importance. 
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Sigal, LS., Harwood, B.G, & Arentzen, (1982) Proc. Nati. Acad. Sci., USA 

79, 7156-7160. 

Spratt, B.G. & Cromie, K.D. (1988) Rev. Infect. Diseases 10, 699-711. 

While the present invention has been described in conjunction with the 
preferred embodiments, one of ordinary skill, having read the foregoing specifi- 
cation, will be able to effect various changes, substitutions of equivalents, and 
alterations to the methods set forth herein. It is therefore intended that the 
protection granted by Letters Patent hereon be limited only by the appended 
claims and equivalents thereof. 
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The embodiments of the invention in which an exclusive property or privilege 
is claimed are defined as follows: 

A method of obtaining an oligonucleotide capable of carrying out a 
predeteteiined biological function, comprising: 

(a) generating a heterogeneous pool of oligonucleotides, x + y + z 
nucleotidesXn length, said oligonucleotides comprising a 5* randomized sequence, x 
nucleotides i^i length, a central preselected sequence, y nucleotides in length, and 
a 3* randomized sequence, z nucleotides in length, said heterogeneous pool having 



possible 



nucleic acid se«quences representing a random sampling of the 4* 
sequences for oligbnucleotides of said length, 

(b) introducing a random sampling of said heterogeneous pool of 
oligonucleotides into\a population of cells that do not exhibit the predetermined 
biological function, 

(c) thereafter screening said population of cells for a 
subpopulation of cells exhibmng said pj^ede^ter mined biological function, and 

(d) isolating f 
comprising said preselected 



m s^id subpjopulation of cells an oligonucleotide 
quence /jand capable of carrying out said 



predetermined biological function^ 




2, A method of obtaining an oHgonucleotide capable of carrying out a 
predetermined biological function, comprisfM: 

(a) generating a heterogeneous pool of oligonucleotides, n 
nucleotides in length, from a mixture of nucleo^des consisting essentially of a% 
adenine, t96 thymidine, c% cytosine, and g% guanin^ wherein a-^t + c+ g= 10096, 
said heterogeneous pool having nucleic acid sequences representing a random 
sampling of the 4^ possible sequences for oligonucleotid^of said length generated 
from nucleotides of said relative percent concentrations, 

(b) introducing a random sampling of said lihterogeneous pool of 
oligonucleotides into a population of cells that do not exhibit ti5,e predetermined 
biological function, 

(c) thereafter screening said population of caUs for a 
subpopulation of cells exhibiting said predetermined biological function, and 

(d) isolating from said subpopulation of cells an oligonucleotide 
f capable of carrying out said predetermined biological function. 
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METHOD FOR PRODUCING NOVEL DMA SEQUENCES / 

WITH BIOLOGICAL ACTIVITY 

Abstract of the Disclosure 
A method of obtaining an oligonucleotide capable of carrying^ out a 
predetermined biological function. A heterogeneous pool of oligonucleotides, 
X + y + z nucleotides in length, is first generated. Each oligonucleotide has a 5^ 
randomized sequence, x nucleotides in length, a central preselected sequence, y 
nucleotides in length, and a 3' randomized sequence, z nucleotides in length. The 
resulting heterogeneous pool contains nucleic acid sequences representing a 
random sampling of the 4,'^'^^ possible sequences for oligonucleotides of the stated 
length. A random sampling of the heterogeneous pool of oligonucleotides is 
introduced into a population of ceils that do not exhibit the predetermined 
biological function* The population of engineered cells is then screened for a 
subpopulation of cells exhibiting the predetermined biological function. From that 
subpopuiation of cells is isolated an oligonucleotide containing the preselected 
sequence and capable of carrying out the predetermined biological function. 
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Step I: Selection of Non-Producer Strain 



CGCCGAGGAA-/S/ -AGTAGTGCT 

23 TCATGACGA 



1, Poll +dNTPs 

2. AvaJ/ Seal 



5'-CCGAGGAA -/\^^- AGT 

COTT-N -TO A 
23 



Seal 



Aval + Seal 




Amp/Tet 



Step II: Selection for Carbentcillin Resistance 

5-CGCCCCGAGGAACGTTTT-/'yV7^ - AGO -//y/^ -AAAGTACTGCT 



1 . Poll + dNTPs 

2, Aval/ Seal 



CCG AGGAACGTTTT- ////^ - AGO • [N]^ AAAGT 
CGTTGCAAAA- /"A//^ - TOG ^/W/^ TTTGA 



Aval 



Seal 




Aval 



Seal 




Tet 



Random 
Insert 
Library 



Aval + Seal 



Carb/Tet 
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Notice of AlIowabilltY 



Application No. 

08/316,415 



Appiicant(s) 



Dube et al. 



Examiner 

Scott D. Priebo, Ph.D. 



Group Art Unit 
1632 




All claims being allowable, PROSECUTION ON THE MERITS IS (OR REMAINS) CLOSED in this application. If not included 
herewith (or previously mailed), a Notice of Allowance and Issue Fee Due or other appropriate communication will be 
mailed in due course. 



Ki This communication is responsive to the amendment fifed 12/2/97 

[El The allowed ciaim(s) is/are 1-3 

□ The drawings filed on are acceptable. 

□ Acknowledgement is made of a claim for foreign priority under 35 U.S.C. § 1 19{a)-{d). 

□ All □ Some* □ None of the CERTIFIED copies of the priority documents have been 

□ received. 

n received in Application No. (Series Code/Serial Number) . 

□ received in this national stage application from the International Bureau {PCT Rule 17.2(a)). 
* Certified copies not received; 



□ Acl<nowledgement is made of a claim for domestic priority under 35 U.S.C. § 1 19(e). 

A SHORTENED STATUTORY PERIOD FOR RESPONSE to comply with the requirements noted below is set to EXPIRE 
THREE MONTHS FROM THE "DATE MAILED" of this Office action. Failure to timely comply will result in 
ABANDONMENT of this application. Extensions of time may be obtained under the provisions of 37 CFR 1.136(a). 

D Note the attached EXAMINER'S AMENDMENT or NOTICE OF INFORMAL APPLICATION, PTO-152, which discloses 
that the oath or declaration is deficient. A SUBSTITUTE OATH OR DECLARATION IS REQUIRED. 

[S Applicant MUST submit NEW FORMAL DRAWINGS 

□ because the originally filed drawings were declared by applicant to be informal. 

including changes required by the Notice of Draftsperson's Patent Drawing Review, PTO-948, attached hereto or 
to Paper No. 4 . 



Q including changes required by the proposed drawing correction filed on 

approved by the examiner. 

□ including changes required by the attached Examiner's Amendment/Comment. 



, which has been 



Identifying indicia such as the application number (see 37 CFR 1.84(c)) should be written on the reverse side of the 
drawings. The drawings should be filed as a separate paper with a transmittal lettter addressed to the Offictai 
Draftsperson. 

□ Note the attached Examiner's comment regarding REQUIREMENT FOR THE DEPOSIT OF BIOLOGICAL MATERIAL. 

Any response to this letter should include, in the upper right hand corner, the APPLICATION NUMBER (SERIES 
CODE/SERIAL NUMBER). If applicant has received a Notice of Allowance and Issue Fee Due, the ISSUE BATCH NUMBER 
and DATE of the NOTICE OF ALLOWANCE should also be included. 

Attachment(s) 

□ Notice of References Cited, PTO-892 

Q Information Disclosure Statement(s), PTO-1449, Paper No(s). 

□ Notice of Draftsperson *s Patent Drawing Review, PTO-948 

□ Notice c|f Informal Patent Application, PTO-152 
Interview Summary, PTO-413 
Examiner's Amendment/Comment 



□ Examiner's Comment Regarding Requirement for Deposit of Biological Material 
Kl Examiner's Statement of Reasons for Allowance 



Serial Number: 08/3 1 6,4 1 5 
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Art Unit: 1632 



EXAMINER'S AMENDMENT 



An examiner's amendment to the record appears below. Should the changes and/or 
additions be unacceptable to applicant, an amendment may be filed as provided by 37 CFR 1.312. 
To ensure consideration of such an amendment, it MUST be submitted no later than the payment 
of the issue fee. 

Authorization for this examiner*s amendment was given in a telephone interview with 
Carol Nottenburg on February 18, 1998. 

The application has been amended as follows: 



In claim 1: in the last line of (fe^^nd (c)v^he commas were replaced with semi-colons and 
in the last line of (d), "a biologically active polypeptide" was replaced with —a biological 
activity of a polyp^tide— . 



In claim 2: in the last line of (by^d the commas were replaced with semi-colons ~;~ and 
in the last line of (d), "a biologically active polypeptide" was replaced with —a biological 
activity of a polypeptide--. 



in the last line of (d), "a biologically active polypeptide" was replaced with —a biological 
activity of a polypeptide--. 





In claim 3 : in the last line of (b) and (c), the commas 



were replaced with semi-colons — ;-- and 



• 
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Art Unit: 1632 



The following is an examiner's statement of reasons for allowance: 

In view of the papers filed 6/2/97, it has been found that this non-provisional application, 
as filed, through error and without deceptive intent, improperly set forth the inventorship, and 
accordingly, this application has been corrected in compliance with 37 CFR 1.48(a). The 
inventorship of this application has been changed by deletion of Deepak K. Dube. 

Therefore, the declaration filed on 12/1 1/95 under 37 CFR 1. 13 1 is sufficient to overcome 
the Childs et al. (UCLA Symp. Molec. Cell. Biol. 30: 341-350, 1986) reference. Consequently, 
the rejection of claims 1-3 under 35 U.S.C. § 103 is withdrawn. 

Any comments considered necessary by applicant must be submitted no later than the 
payment of the issue fee and, to avoid processing delays, should preferably accompany the issue 
fee. Such submissions should be clearly labeled "Comments on Statement of Reasons for 
Allowance." 



Certain papers related to this application may be submitted to Art Unit 1632 by facsimile 
transmission. The FAX number is (703) 308-4242 or 305-3014. The faxing of such papers must 
conform with the notices published in the Official Gazette, 1 156 OG 61 (November 16, 1993) and 
1 157 OG 94 (December 28, 1993) (see 37 CFR 1.6(d)). NOTE: If applicant does submit a paper 
by FAX, the original copy should be retained by applicant or applicant's representative. NO 
DUPLICATE COPIES SHOULD BE SLJBMITTED, so as to avoid the processing of duplicate 
papers in the Office. 

Any inquiry concerning this communication or earlier communications fi-om the examiner 
should be directed to Scott D. Priebe whose telephone number is (703) 308-73 10. The examiner 
can r|ormally be reached on Monday through Friday fi-om 9 AM to 5 PM. If attempts to reach the 
examiner by telephone are unsuccessfiai, the examiner's supervisor, Jasemine Chambers, Ph.D., 
can be reached on (703) 308-2035. 
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Art Unit: 1632 



Any inquiry of a general nature or relating to the status of this application should be 
directed to the Grroup receptionist whose telephone number is (703) 308-0196. 

Scott D. Priebe, Ph.D. February 18, 1998 

Patent Examiner 
Art Unit 1632 
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COMBINED DECLARATION AND POWER OF ATTORNEY 

IN PATENT APPLICATION 

Attorney Docket No. UOFW-1-4539 

As a below named inventor, I hereby declare that: 

my residence, post office address and citizenship are as stated below next to 
my name; 

I believe that I am a joint inventor of the subject matter that is claimed and 
for which patent is sought on the invention entitled METHOD FOR PRODUCING 
NOVEL DNA SEQUENCES WITH BIOLOGICAL ACTIVITY, the specification of 
which 

(check one) is attached hereto. 

X was filed on June 19, 1989, as Application Serial No* 
07/368,674. 

I hereby state that I have reviewed and understand the contents of the above 
identified specification, including the claims, as amended by any amendment 
referred to above. 

I acknowledge the duty to disclose information which is material to the 
examination of this application in accordance with Title 37, Code of Federal 
Regulations, Section 1.56(a). 

I hereby claim the benefit under Title 35, United States Code, Section 120 of 
any United States application(s) listed below, and, insofar as the subject matter of 
each of the claims of this application is not disclosed in the prior United States 
application in the manner provided by the first paragraph of Title 35, United 
States Code, Section 112, I acknowledge the duty to disclose material information 
as defined in Title 37, Code of Federal Regulations, Section 1, 56(a) which 
occurred between the filing date of the prior application and the national or PCT 
international filing date of this application: 

Status (patented, 

Serial No> Filing Date pending, a bandoned) 
06/887,070 July 17. 1986 pe nding 



I hereby appoint the following attorneys and/or agents to prosecute this 
application and to transact all business in the United States Patent and Trademark 
Office connected therewith: Bruce E, O'Connor, Reg. No, 24,849; Lee E. Johnson, 
Reg. No. 22,946; Gary S. Kindness, Reg. No. 22,178; James W. Anable, 
Reg. No. 26,827; James R. Uhlir, Reg. No. 25,096; Jerald E. Nagae, 
Rag. No. 29,418; Michael W, Bocianowski, Reg. No. 28,692; Michael G. Toner, 
Rig. No, 27,601; John B. Mason, Reg. No. 29,563; Thomas F. Broderick, Reg. 
No. 31,332; Dennis K. Sheiton, Reg. No. 26,997; Robert R. Cook, Reg. No. 31,602; 



1 of 2 



and the firm of Christensen, 0*Connor, Johnson 6c Kindness, Address ail telephone 
calls to Thomas F. Broderick at telephone No. (206) 441-8780. 



Address ail correspondence to: 



CHRISTENSEN, O'CONNOR, JOHNSON & KINDNESS 

2700 Westin Building 
2001 Sixth Avenue 
Seattle, WA 98121 

I hereby further declare that ail statements made herein of my own 
knowledge are true and that all statements made on information and belief are 
believed to be true; and further that these statements were made with the 
knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 



I -00 



Full Name of Inventor 
Dipak Dahte^ 



Citizenship 

India 



Residence 

6527 - 25th Avenue Northeast, Seattle, Was hington 981 15 

_ _ . _ . _ _ III — I . _ ■ .J _ — "~~ 

Post Office Address 
Same as above. 



Inventor's Signature 



^ . Date 



0 



,) 



Mi 




Citizenship 
U.S. 



Full Name of Inventor 
Marshall S^ Hoxwltz, 
Residence" 

925 - 8Qth Avenue Northeast, Bellevue, Washington 98004 

Post Office Address 
Same as above. 

Inveptpr's Signjtture 




Date 



Full Name of Inventor 

Lawrence Loefe— ^ 



Citizenship 
U.S. 



Residence 

13808 Northeast 36th Place, Bellevue, Washington 98005 

Post Office Address 
Same as above. 





y.Date 



Express Mail 
TB337417234 



PATENT 

I hereby certify that on the date specified below, this correspondence is being 
deposited with the United States Postal Service as first-class mail in an envelope 
addressed to: Conmiissioner of Patents and Trademarks, Washington, DC 2023 1. 

Date David D. McMasters 



THE UNITED STATES PATENT AND TRADEMARK OFHCE 



Applicants : Dipak K. Dube, Marshall S. Horwitz, and Lawrence A Loeb 

Serial No. : 08/105,108 

FUed : August 11, 1993 

For : METHOD FOR PRODUCING NOVEL DNA SEQUENCES 

WITH BIOLOGICAL ACTIVITY 

Art Unit : 1805 

Docket No. : 240052.408C3 

Date : November 10, 1993 



Commissioner of Patents and Trademarics 
Washu^n, DC 2023 1 

REVOCATION AND SUBSTITUTE POWER OF ATTORNEY 

Sin 

In tiie matter of the above-identified application, I hereby revoke all 
power of attorney heretofore given and hereby appoint RICHARD W. SEED, 
Registration No. 16,557; ROBERT J. BAYNHAM, Registration No. 22,846; 
EDWARD W. BULCmS, Registration No. 26,847; GEORGE C. RONDEAU, JR., 
Registration No. 28,893; DAVID H. DEITS, Registration No. 28,066; WILLIAM O. 
FERRON, JR., Registration No. 30,633; PAUL T. MEIKLEJOHN, Registration No. 
26,569; DAVID J. MAKI, Registration No. 31,392; RICHARD G. SHARKEY, 
Registration No. 32,629; GEORGE B. FOX Registration No. 31,510; DAVID V. 
CARLSON, Registration No. 31,153; MAURICE J. PIRIO, Registration No. 33,273; 
KARL R. HERMANNS, Registration No. 33,507; DAVID D. McMASTERS, 
Registration No. 33,963; ROBERT lANNUCCI, Registration No. 33,514; S. KEVIN 
PICKENS, Registration No. 34,696; JOSHUA KING, Registration No. 35,570; 



# 



MICHAEL J, DONOHUE, Registration No, 35,859; LORRAINE LINFORD, 
Registration No. 35,939; KEVIN L CANNING, Registration No, 35,470; LYNETTE 
WYLIE, Registration No, 34,951; PATRICIA E. CHAMBERLAIN, Registration No. 
37,038; CHRISTOPHER L DALEY, Registration No. 34,807; TIMOTHY J. CREAN, 
Registration No. 37,116; STEVEN D. LAWRENZ, Registration No. 37,376; 
ROBERT G. WOOLSTON, Registration No. 37,263; and MANYA S. DEEHR, 
Registration No. 37,120, composing flie firm of SEED and BERRY, 6300 Columbia 
Center, Seattle, Washington 98104-7092, as the principal attomeys with full power of 
substitution, association, and revocation to prosecute said application, to transact all 
business in the Patent and Trademark Office connected therewith, and to receive the 
letters patent therefor. Please direct all future correspondence to: 



and direct all telephone calls to David D. McMasters at (206) 622-4900 and telecopies 
to (206) 682-^031. 



David D. McMasters 

SEED and BERRY 

6300 Columbia Center 

Seattie, Washington 98104-7092 



Date 



John P. E )fe sRo o ie Tj-PhrO, 
OfBce of Technology Transfer, 
University of Washington 




2 



Express Mail 
TB337417234 




COMBINED DECLARATION AND POWER OF ATTORNEY 

IN PATENT APPLICATION 

Attorney Docket No. UOFW-1-4539 

As a below named inventor, I hereby declare that: 

my residence, post office address and citizenship are as stated below next to 
my name; 

I believe that I am a joint inventor of the subject matter that is claimed and 
for which patent is sought on the invention entitled METHOD FOR PRODUCING 
NOVEL DNA SEQUENCES WITH BIOLOGICAL ACTIVITY, the specification of 
which 

(check one) is attached hereto. 

was filed on June 19, 1989, as Application Serial No. 



07/368,674. 

I hereby state that I have reviewed and understand the contents of the above 
identified specification, including the claims, as amended by any amendment 
referred to above. 

I acknowledge the duty to disclose information which is material to the 
examination of this application in accordance with Title 37, Code of Federal 
Regulations, Section 1.56(a). 

I hereby claim the benefit under Title 35, United States Code, Section 120 of 
any United States appiication(s) listed below, and, insofar as the subject matter of 
each of the claims of this application is not disclosed in the prior United States 
application in the manner provided by the first paragraph of Title 35, United 
States Code, Section 112, I acknowledge the duty to disclose material information 
as defined in Title 37, Code of Federal Regulations, Section l-56(a) which 
occurred between the filing date of the prior application and the national or PCT 
internationial filing date of this application: 

Status (patented. 

Serial No. Filing Date pending, a bandoned) 
06/887,070 July 17, 1986 pe nding 



I hereby appoint the following attorneys and/or agents to prosecute this 
application and to transact all business in the United States Patent and Trademark 
Office connected therewith: Bruce E, O'Connor, Reg. No. 24,849; Lee E. Johnson, 
Reg. No. 22,946; Gary S. Kindness, Reg. No, 22,178; James W. Anable, 
Reg. No, 26,827; James R. Uhiir, Reg, No. 25,096; Jerald E. Nagae, 
Reg, No. 29,418; Michael W, Bocianowski, Reg. No. 28,692; Michael G. Toner, 
Reg. No, 27,601; John B. Mason, Reg, No, 29,563; Thomas F, Broderick, Reg. 
No, 31,332; Dennis K. Sheiton, Reg. No, 26,997; Robert R, Cook, Reg. No, 31,602; 
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# 




and the firm of Christensen, O'Connor, Johnson 5c Kindness- Address all telephone 
calls to Thomas Broderick at telephone No. (206) 441-8780. 

Address all correspondence to: 

CHRISTENSEN, O'CONNOR, JOHNSON & KINDNESS 

2700 Westin Building 
2001 Sixth Avenue 
Seattle, WA 98121 

I hereby further declare that all statements made herein of my own 
knowledge are true and that all statements made on information and belief are 
believed to be true; and further that these statements were made^ with the 
knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon- 



Full Name of Inventor Citizenship 
Dipak K, Dube India 



Residence , ... ^ rxr^^Ar- 

6527 - 26th Avenue Northeast^ Seattle, Washington 9811 5 
Post Office Address 

Same as above. 



Inventor's Signature ^ ^Sj^^h 



Full Name of Inventor Citizenship 

Marshall S. Horwitz 



Residence 

925 - 8Qth Avenue Northeast, Bellevue, ' Washington 98004 
Post Office Address 

Same as above, 



Full Name of Inventor Citizenship 

Lawrence A. Loeb 

Residence 

13808 Northeast 36th Place, Bellevue, Washington 98005 

Post Office Address 
Same as above. 



Inve1itor's_SiscHature 





.Date 
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DECLARATION AND POWER OF ATTORNEY 

We, Marshall S. Horwitz and Lawrence A. Loeb, whose application for letters 
patent for an invention entitled "METHOD FOR PRODUCING NOVEL DNA 
SEQUENCES WITH BIOLOGICAL ACTIVITY;* Application No, 08/316,415, which was 
filed in the United States Patent and Trademark OlTice on September 30^ 1994; which was a 
continuation of Application No. 08/105,108, which was filed in the United States Patent and 
Trademark Office on August 11, 1993; which was a continuation of Application No. 
07/881,607, which was filed in the United States Patent and Trademark Offl;:e on May 12, 
1992; which was a continuation of Application No. 07/368,674, which was filed in the 
United States Patent and Trademark Office on June 19, 1989, declare tliat this application 
(liereinafler referred to as "later C-I-P" application) in part discloses and claims subject 
matter disclosed in our earlier-filed Application No. 06/887,070, filed July 17, 1986, which 
was abandoned, and that ail amendments filed in the United States Patent and Trademark 
Office claiming subject matter disclosed \n said earlier application were part of said invention 
and were invented before the filing of said earlier application for said invention. Wc have 
reviewed and understand the contents of this later C-I-P specification, including the claims, 
and we believe we are the original, first, and joint inventors of the invention entitled 
"METHOD FOR PRODUCDsIG NOVEL DNA SEQUENCES WITH BIOLOGICAL 
ACTIVITY^" which is described and claimed in this later C-I-P specification and for which a 
patent is sought. 

We acknowledge our duty to disclose information of which we arc aware 
which is material to the examination of this application in accordance witli 37 CF.R. § 
1 .56(a), including material information which occurred between the filin^j date of said earlier- 
filed pending application and the present. 

Wc hereby appoint RICHARD W, SHED, Registration No. 16,557; 
ROBERT J. BAYNHAM. Registration No. 22,846; EDWARD W. BULCHIS, Registration 
No. 26,847; GEORGE C. RONDEAU. JR., Registration No. 28,893; DAVID H. DEITS, 
Registration No. 28,066; WILLIAM O. FERRON, JR., Registration No. 30,633; PAUL T. 
MEIKLEJOHN, Registration No. 26,569; DAVID L MAKI, Registration No. 31,392; 
RICHARD G. SHARKEY, Registration No. 32,629; DAVID V. CARLSON, Registration 
No. 31J53; MAURICE L PIRIO, Registration No. 33,273; KARL R. HERMANNS, 
Registration No. 33,507; DAVID D. McMASTERS, Registration No, 33,963; ROBERT 
lANNUCCI, Registration No. 33,514; JOSHUA KING, Registration No. 35,570; 
MICHAEL J- DONOHUE, Registration No. 35,859; KEVIN L CANNING, Registration 
No, 35,470; CHRISTOPHER J. DALEY- WATSON, Registration No. 34,807; STEVEN D. 
LAWRENZ, Registration No. 37,376; ROBERTO. WOOLSTON, Registration No. 37,263; 
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CLARENCE T. TEGREENE, Registration No. 37,951; ELLEN M. BTERMAN, Registration 
No. 38,079; BRYAN A. SANTARELLI, Registration No, 37,560; MICHAEL L. KIKLIS, 
Regisimtion No. 38,939; CAROL NOTTBNBURG, Registration No, 39,317; CRAIG S. 
JEPSON, Registration No. 33,517; FAULT. PARKER, Registration No. 38,264; JOHN C, 
STEWART, Registration No. 40,188; ROBERT W. BKRGSTROM, Rcgiiilration 
No, 39,906; DAVID W. PARKER, Registration No. 37,414; ROBERT E. MATES, 
Registration No. 35,271; and BRIAN G, BODiNE, Registration No. P-40,520; comprising 
llie firm of SEED and BERRY, 6300 Columbia Center, Seattle, Washington 98104-7092, as 
our attorneys to prosecute this application and transact all business in the Patent and 
Trademark Office connected therewith. Please direct all future correspondence to: 

Carol Nottcnburg, Ph.D. 
SEED and BERRY 
6300 Columbia Center 
Seattle, Washington 98104-7092 



imd direct all telephone calls to Carol NoUenburg at (206) 622-4900 and telecopies^ to (206) 
682-6031. 

We further declare that all statements made herein of our own knowledge are 
true and that ail statements made on information and belief are believed to be true; and 
further, that these statements were made with the knowledge that tlie making of willfully 
false statements and the like is punishable by fine or imprisonment, or both, imder Section 
1001 of Title 18 of the United States Code, and may jeopardize the validity of any patent 
issuing from this patent application. 



"Marshall S. Horwitz ^ 



Marsh! 




to Mt^4l^fl'^^'^ tJ(tJi,V s ^ 

sidence ' City of Bolle^ntc, C 



Date 

Residence ' : ^ City of Dollc^ntc, toimty of King 

State of Washington 
Citizenship : United States of America 

P.O. Address : 925 80th Avenue Nortlieast 

Bcllevue, Washington 98004 
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Residence 



Citizenship 
P.O. Address 



City of Bellcvue, County of King 

State of Wasliington 

United States of America 

13808 Northeast 36th Place 
Bellevue, Washington 98005 
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